HIGH FIDELITY REVERSE TRANSCRIPTASES AND USES THEREOF 



Cross Reference to Related Applications 

The present application claims the benefit of U.S. Provisional 
Application No. 60/189,454, filed March 15, 2000, the contents of which are 
incorporated by reference herein in their entirety. 

Field of the Invention 

[0001] The present invention is in the fields of molecular and cellular biology. 

The invention is generally related to reverse transcriptase enzymes and 
methods for the reverse transcription of nucleic acid molecules, especially 
messenger RNA molecules. Specifically, the invention relates to reverse 
transcriptase enzymes which have been mutated or modified to increase 
fidelity, and/or decrease terminal deoxynucleotidyl transferase activity, and to 
methods of producing, amplifying or sequencing nucleic acid molecules 
(particularly cDNA molecules) using these reverse transcriptase enzymes or 
compositions. The invention also relates to nucleic acid molecules produced 
by these methods and to the use of such nucleic acid molecules to produce 
desired polypeptides. The invention also concerns kits comprising such 
enzymes or compositions. 

BACKGROUND OF THE INVENTION 

cDNA and cDNA Libraries 

[0002] In examining the structure and physiology of an organism, tissue or 

cell, it is often desirable to determine its genetic content. The genetic 
framework of an organism is encoded in the double-stranded sequence of 
nucleotide bases in the deoxyribonucleic acid (DNA) which is contained in the 
somatic and germ cells of the organism. The genetic content of a particular 
segment of DNA, or gene, is only manifested upon production of the protein 
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which the gene encodes. In order to produce a protein, a complementary copy 
of one strand of the DNA double helix (the "coding" strand) is produced by 
polymerase enzymes, resulting in a specific sequence of ribonucleic acid 
(RNA). This particular type of RNA, since it contains the genetic message 
from the DNA for production of a protein, is called messenger RNA (mRNA). 

[0003] Within a given cell, tissue or organism, there exist a myriad of mRNA 

species, each encoding a separate and specific protein. This fact provides a 
powerful tool to investigators interested in studying genetic expression in a 
tissue or cell - mRNA molecules may be isolated and further manipulated by 
various molecular biological techniques, thereby allowing the elucidation of 
the full functional genetic content of a cell, tissue or organism. 

[0004] One common approach to the study of gene expression is the 

production of complementary DNA (cDNA) clones. In this technique, the 
mRNA molecules from an organism are isolated from an extract of the cells or 
tissues of the organism. This isolation often employs solid chromatography 
matrices, such as cellulose or agarose, to which oligomers of thymidine (T) 
have been complexed. Since the 3' termini on most eukaryotic mRNA 
molecules contain a string of adenosine (A) bases, and since A binds to T, the 
mRNA molecules can be rapidly purified from other molecules and substances 
in the tissue or cell extract. From these purified mRNA molecules, cDNA 
copies may be made using the enzyme reverse transcriptase (RT), which 
results in the production of single-stranded cDNA molecules. The 
single-stranded cDNAs may then be converted into a complete 
double-stranded DNA copy {i.e., a double- stranded cDNA) of the original 
mRNA (and thus of the original double-stranded DNA sequence, encoding 
this mRNA, contained in the genome of the organism) by the action of a DNA 
polymerase. The protein-specific double-stranded cDNAs can then be inserted 
into a plasmid or viral vector, which is then introduced into a host bacterial, 
yeast, animal or plant cell. The host cells are then grown in culture media, 
resulting in a population of host cells containing (or in many cases, 
expressing) the gene of interest. 
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[0005] This entire process, from isolation of mRNA to insertion of the cDNA 

into a plasmid or vector to growth of host cell populations containing the 
isolated gene, is termed "cDNA cloning." If cDNAs are prepared from a 
number of different mRNAs, the resulting set of cDNAs is called a "cDNA 
library," an appropriate term since the set of cDNAs represents a "population" 
of genes comprising the functional genetic information present in the source 
cell, tissue or organism. Genotypic analysis of these cDNA libraries can yield 
much information on the structure and function of the organisms from which 
they were derived. 

Retroviral Reverse Transcriptase Enzymes 

[0006] Three prototypical forms of retroviral RT have been studied 

thoroughly. Moloney Murine Leukemia Virus (M-MLV) RT contains a single 
subunit of 78 kDa with RNA-dependent DNA polymerase and RNase H 
activity. This enzyme has been cloned and expressed in a fully active form in 
E. coli (reviewed in Prasad, V.R., Reverse Transcriptase, Cold Spring Harbor, 
New York: Cold Spring Harbor Laboratory Press, p. 135 (1993)). Human 
Immunodeficiency Virus (HIV) RT is a heterodimer of p66 and p51 subunits 
in which the smaller subunit is derived from the larger by proteolytic cleavage. 
The p66 subunit has both a RNA-dependent DNA polymerase and an RNase 
H domain, while the p51 subunit has only a DNA polymerase domain. Active 
HIV p66/p51 RT has been cloned and expressed successfully in a number of 
expression hosts, including E. coli (reviewed in Le Grice, S.F.J., Reverse 
Transcriptase, Cold Spring Harbor, New York: Cold Spring Harbor 
Laboratory press, p. 163 (1993)). Within the HIV P 66/p51 heterodimer, the 
51-kD subunit is catalytically inactive, and the 66-kD subunit has both DNA 
polymerase and RNase H activity (Le Grice, S.F.J., et al, EMBO Journal 
10:3905 (1991); Hostomsky, Z., et al., J. Virol. 66:3179 (1992)). Avian 
Sarcoma-Leukosis Virus (ASLV) RT, which includes but is not limited to 
Rous Sarcoma Virus (RSV) RT, Avian Myeloblastosis Virus (AMV) RT, 
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Avian Erythroblastosis Virus (AEV) Helper Virus MCAV RT, Avian 
Myelocytomatosis Virus MC29 Helper Virus MCAV RT, Avian 
Reticuloendotheliosis Virus (REV-T) Helper Virus REV-A RT, Avian 
Sarcoma Virus UR2 Helper Virus UR2AV RT, Avian Sarcoma Virus Y73 
Helper Virus YAV RT, Rous Associated Virus (RAV) RT, and Myeloblastosis 
Associated Virus (MAV) RT, is also a heterodimer of two subunits, a 
(approximately 62 kDa) and p (approximately 94 kDa), in which a is derived 
from p by proteolytic cleavage (reviewed in Prasad, V.R., Reverse 
Transcriptase, Cold Spring Harbor, New York: Cold Spring Harbor 
Laboratory Press (1993), p. 135). ASLV RT can exist in two additional 
catalytically active structural forms, pp and a (Hizi, A. and Joklik, W.K., J. 
Biol. Chem. 252: 2281 (1977)). Sedimentation analysis suggests ap and PP 
are dimers and that the a form exists in an equilibrium between monomeric 
and dimeric forms (Grandgenett, D.P., et al, Proc. Nat. Acad. Set USA 70: 
230 (1973); Hizi, A. and Joklik, W.K., J. Biol. Chem. 252: 2281 (1977); and 
Soltis, D.A. and Skalka, A.M., Proc. Nat. Acad. Sci. USA 85: 3372 (1988)). 
The ASLV ap and pp RTs are the only known examples of retroviral RT that 
include three different activities in the same protein complex: DNA 
polymerase, RNase H, and DNA endonuclease (integrase) activities (reviewed 
in Skalka, A.M., Reverse Transcriptase, Cold Spring Harbor, New York: Cold 
Spring Harbor Laboratory Press (1993), p. 193). The a form lacks the 
integrase domain and activity. 
[0007] Various forms of the individual subunits of ASLV RT have been 

cloned and expressed. These include a 98-kDa precursor polypeptide that is 
normally processed proteolytically to p and a 4-kDa polypeptide removed 
from the p carboxy end (Alexander, F., et al, J. Virol. 61: 534 (1987) and 
Anderson, D. et al, Focus 77:53 (1995)), and the mature p subunit (Weis, J.H. 
and Salstrom, J.S., U.S. Patent No. 4,663,290 (1987); and Soltis, D.A. and 
Skalka, A.M., Proc. Nat. Acad. Sci. USA 85:3312 (1988)). Heterodimeric 



RS V ocp RT has also been purified from E. coli cells expressing a cloned RSV 
(3 gene (Chernov, A.P., et al, Biomed ScL 2:49 (1991)). 

Various domains of the reverse transcriptases have also been identified, 
for example, the thumb, fingers and palm regions. The thumb region is of 
particular importance in that, mutations in this region have been shown to reduce 
the incidence of frame shifting. 

Reverse Transcription Efficiency and Fidelity 

[0008] As noted above, the conversion of mRNA into cDNA by RT-mediated 

reverse transcription is an essential step in the study of proteins expressed 
from cloned genes. However, the use of unmodified RT to catalyze reverse 
transcription is inefficient for at least two reasons. First, RT sometimes 
renders an RNA template unable to be copied before reverse transcription is 
initiated or completed, primarily due to the intrinsic RNase H activity present 
in RT. Second, RTs generally have low fidelity. That is, RTs incorporate 
mismatched bases during cDNA synthesis thus producing cDNA products 
having sequence errors. RTs have in fact been shown to incorporate one base 
error per 3000-6000 nucleotides for HIV RT, and 1/10,000 nucleotide for 
AMV RT during cDNA synthesis (Berger, S.L., et al, Biochemistry 
22:2365-2372 (1983); Krug, M.S., and Berger, S.L., Meth. Enzymol 752:316 
(1987); Berger et al Meth Enzymol 275: 523 (1996)). 

[0009] Removal of the RNase H activity of RT can eliminate the first problem 

and improve the efficiency of reverse transcription (Gerard, G.F., et al, 
FOCUS 11(4):6Q (1989); Gerard, G.F., et al, FOCUS 14(3):9l (1992)). 
However such RTs ("RNase H' n forms) do not address the second problem of 
improving the fidelity of reverse transcription. The present invention 
addresses this need. 
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[0010] The present invention provides reverse transcriptase enzymes, 

compositions comprising such enzymes and methods useful in overcoming the 
efficiency limitations of reverse transcription. In general, the invention 
provides compositions for use in high fidelity reverse transcription of a nucleic 
acid molecule comprising one or more polypeptides having reverse 
transcriptase activity of the invention. Such compositions may further 
comprise one or more nucleotides, a suitable buffer, and/or one or more DNA 
polymerases. The compositions of the invention may also comprise one or 
more oligonucleotide primers. 

[0011] The reverse transcriptases of the invention are preferably modified or 

mutated such that the fidelity of the enzyme is increased or enhanced. 
Additional embodiments of the invention include reverse transcriptases that 
are modified to decrease or eliminate terminal deoxynucleotidyl transferase 
(TdT) activity. The reverse transcriptases of the invention are preferably 
single chained (single-subunit) or multi-chained (multi-subunit), and are 
reduced or substantially reduced in RNase H activity, and most preferably are 
enzymes selected from the group consisting of Moloney Murine Leukemia 
Virus (M-MLV) FT reverse transcriptase, Rous Sarcoma Virus (RSV) H" 
reverse transcriptase, Avian Myeloblastosis Virus (AMV) H" reverse 
transcriptase, Rous Associated Virus (RAV) FT reverse transcriptase, 
Myeloblastosis Associated Virus (MAV) H" reverse transcriptase and Human 
Immunodeficiency Virus (HIV) H" reverse transcriptase or other ASLV H" 
reverse transcriptases. In preferred compositions, the reverse transcriptases 
are present at working concentrations. 

[0012] Enzymes of the invention include reverse transcriptases which exhibit 

reverse transcriptase activity either upon the formation of multimers (e.g., 
dimers) or as individual protein molecules (i.e., in monomeric form). 
Examples of reverse transcriptases which exhibit reverse transcriptase activity 
upon the formation of multimers include AMV, RSV and HIV reverse 



transcriptases. An example of a reverse transcriptases which exhibits reverse 
transcriptase activity as separate, individual proteins include M-MLV and 
RSV reverse transcriptase. 

[0013] Multimeric reverse transcriptases of the invention may form homo- 

multimers or hetero-multimers. In other words, the subunits of the multimeric 
protein complex may be identical or different. One example of a hetero- 
dimeric reverse transcriptase is AMV reverse transcriptase, which is composed 
of two subunits that differ in primary amino acid sequence. More specifically, 
as already discussed, AMV reverse transcriptase may be composed of two 
subunits wherein one of these subunits is generated by proteolytic processing 
of the other. Thus, dimeric AMV reverse transcriptase may be composed of 
subunits of differing size that share regions of amino acid sequence identity. 

[0014] The present invention relates in particular to mutant or modified 

reverse transcriptases wherein one or more amino acid changes have been 
made which renders the enzyme more faithful (higher fidelity) in nucleic acid 
synthesis. The preferred sites for mutation or modification to produce higher 
fidelity polymerases are listed for some reverse transcriptases in Table 1. 
Similar or equivalent sites or corresponding sites in other reverse trancriptases 
can be mutated to produce higher fidelity reverse transcriptases. 

Table 1: 
RT Amino acid 

M-MLV Y64, Rl 16, K152, Q190, T197, V223, D124, H126, Y133 
AMV W25, R76, Kl 10, Q149, T156, Ml 82 

RSV W25, R76, K110, Q149, T156, Ml 82 

HIV W24, R78, Gl 12, Q151, A158, M184 

The invention further includes M-MLV RT having the following mutations: 

V223H, Q190F, T197A, T197E, Y64W, Rl 16M and K152R as well as other RTs 

having corresponding mutations or modifications. 
[0015] The present invention is also directed to mutant or modified reverse 

transcriptases wherein one or more amino acid changes have been made which 



decrease or eliminate terminal deoxy nucleotidyl transferase (TdT) activity. 
The preferred sites for these mutations include, but are not limited to, F309, 
T197 and Y133 of M-MLV RT. Specific mutations or modifications of 
M-MLV RT include T197E, which reduces TdT activity to a level 
undetectable by assay methods described herein; and T197A, which reduces 
TdT activity, to a lesser extent. Similar or equivalent sites or corresponding 
sites in other reverse transcriptases can be mutated to produce reverse 
transcriptases with reduced, substantially reduced or eliminated TdT activities. 
Examples of such equivalent sites include, but are not limited to, W266 and 
194 in HIV RT, W267 and A95 in AMV RT, and W267 and A95 in RSV RT. 
[0016] In specific embodiments, reverse transcriptases of the invention may 

not include M-MLV RT, HIV RT, AMV RT, or RSV RT. Thus, for example, 
in certain embodiments the invention includes RTs with increased fidelity that 
are not HIV RTs. 

[0017] The present invention is also directed to DNA molecules (preferably 

vectors) containing a gene or nucleic acid molecule encoding the mutant or 
modified reverse transcriptases of the present invention and to host cells 
containing such DNA molecules. Any number of hosts may be used to 
express the gene or nucleic acid molecule of interest, including prokaryotic 
and eukaryotic cells. Preferably, prokaryotic cells are used to express the 
polymerases of the invention. The preferred prokaryotic host according to the 
present invention is E. coli. 

[0018] The invention also relates to a method of producing the reverse 

transcriptases of the invention, said method comprising: 

(a) culturing the host cell comprising a gene or nucleic acid molecule 
encoding a reverse transcriptase of the invention (preferably such RT gene is 
contained by a vector within the host cell); 

(b) expressing said gene or nucleic acid molecule; and 

(c) isolating said reverse transcriptase from said host cell. 

[0019] The invention is also directed to methods for making one or more 

nucleic acid molecules, comprising mixing one or more nucleic acid templates 
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(preferably one or more RNA templates and most preferably one or more 
messenger RNA templates) with one or more reverse transcriptases of the 
invention and incubating the mixture under conditions sufficient to make a 
first nucleic acid molecule or molecules complementary to all or a portion of 
the one or more nucleic acid templates. In a preferred embodiment, the first 
nucleic acid molecule is a single- stranded cDNA. Nucleic acid templates 
suitable for reverse transcription according to this aspect of the invention 
include any nucleic acid molecule or population of nucleic acid molecules 
(preferably RNA and most preferably mRNA), particularly those derived from 
a cell or tissue. In a preferred aspect, a population of mRNA molecules (a 
number of different mRNA molecules, typically obtained from cells or tissue) 
are used to make a cDNA library, in accordance with the invention. Preferred 
cellular sources of nucleic acid templates include bacterial cells, fungal cells, 
plant cells and animal cells. 
[0020] The invention also relates to methods for making one or more 

double-stranded nucleic acid molecules. Such methods comprise (a) mixing 
one or more nucleic acid templates (preferably RNA or mRNA, and more 
preferably a population of mRNA templates) with one or more reverse 
transcriptases of the invention; (b) incubating the mixture under conditions 
sufficient to make a first nucleic acid molecule or molecules complementary 
to all or a portion of the one or more templates; and (c) incubating the first 
nucleic acid molecule or molecules under conditions sufficient to make a 
second nucleic acid molecule or molecules complementary to all or a portion 
of the first nucleic acid molecule or molecules, thereby forming one or more 
double-stranded nucleic acid molecules comprising the first and second 
nucleic acid molecules. Such methods may include the use of one or more 
DNA polymerases as part of the process of making the one or more 
double-stranded nucleic acid molecules. The invention also concerns 
compositions useful for making such double-stranded nucleic acid molecules. 
Such compositions comprise one or more reverse transcriptases of the 
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invention and optionally one or more DNA polymerases, a suitable buffer, one 
or more primers, and/or one or more nucleotides. 

[0021] The invention also relates to methods for amplifying a nucleic acid 

molecule. Such amplification methods comprise mixing the double-stranded 
nucleic acid molecule or molecules produced as described above with one or 
more DNA polymerases and incubating the mixture under conditions 
sufficient to amplify the double-stranded nucleic acid molecule. In a first 
preferred embodiment, the invention concerns a method for amplifying a 
nucleic acid molecule, the method comprising (a) mixing one or more nucleic 
acid templates (preferably one or more RNA or mRNA templates and more 
preferably a population of mRNA templates) with one or more reverse 
transcriptases of the invention and with one or more DNA polymerases and (b) 
incubating the mixture under conditions sufficient to amplify nucleic acid 
molecules complementary to all or a portion of the one or more templates. 
Preferably, the reverse transcriptases are reduced or substantially reduced in 
RNase H activity and the DNA polymerases comprise a first DNA polymerase 
having 3 1 exonuclease activity and a second DNA polymerase having 
substantially reduced 3' exonuclease activity. The invention also concerns 
compositions comprising one or more reverse transcriptases of the invention 
and one or more DNA polymerases for use in amplification reactions. Such 
compositions may further comprise one or more nucleotides and/or a buffer 
suitable for amplification. The compositions of the invention may also 
comprise one or more oligonucleotide primers. 

[0022] The invention is also directed to nucleic acid molecules (particularly 

single- or double-stranded cDNA molecules) or amplified nucleic acid 
molecules produced according to the above-described methods and to vectors 
(particularly expression vectors) comprising these nucleic acid molecules or 
amplified nucleic acid molecules. 

[0023] The invention is also directed to recombinant host cells comprising the 

above-described nucleic acid molecules, amplified nucleic acid molecules or 
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vectors. Preferred such host cells include bacterial cells, yeast cells, plant 
cells and animal cells (including insect cells and mammalian cells). 

[0024] The invention is further directed to methods of producing a 

polypeptide encoded by the nucleic acid molecules produced by the methods 
of the invention. Such methods comprise culturing the above-described 
recombinant host cells and isolating the encoded polypeptide, and to a 
polypeptide produced by such methods. 

[0025] The invention also concerns methods for sequencing one or more 

nucleic acid molecules using the compositions or enzymes of the invention. In 
specific embodiments, such methods comprise (a) mixing a nucleic acid 
molecule (e.g., an RNA or DNA molecules) to be sequenced with one or more 
primers, one or more reverse transcriptases of the invention, one or more 
nucleotides and one or more terminating agents, such as one or more 
dideoxynucleoside triphosphates; (b) incubating the mixture under conditions 
sufficient to synthesize a population of nucleic acid molecules complementary 
to all or a portion of the one or more nucleic acid molecules to be sequenced; 
and (c) separating members of the population of nucleic acid molecules to 
determine the nucleotide sequence of all or a portion of the one or more 
nucleic acid molecules to be sequenced. 

[0026] In other embodiments, such methods comprise (a) mixing a nucleic 

acid molecule (e.g., an RNA or DNA molecule) to be sequenced with one or 
more primers, one or more reverse transcriptases of the invention, one or more 
nucleotides and one or more terminating agents, such as one or more 
dideoxynucleoside triphosphates; (b) incubating the mixture under conditions 
sufficient to synthesize a population of nucleic acid molecule complementary 
to all or a portion of the nucleic acid molecule to be sequenced; and (c) 
separating the population of nucleic acid molecules to determine the 
nucleotide sequence of all or a portion of the nucleic acid molecule to be 
sequenced. 

[0027] The invention is also directed to kits for use in the methods of the 

invention. Such kits can be used for making, sequencing or amplifying 
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nucleic acid molecules (single- or double-stranded). The kits of the invention 
comprise a carrier, such as a box or carton, having in close confinement 
therein one or more containers, such as vials, tubes, bottles and the like. In the 
kits of the invention, a first container contains one or more of the reverse 
transcriptase enzymes of the present invention. The kits of the invention may 
also comprise, in the same or different containers, one or more DNA 
polymerase (preferably thermostable DNA polymerases), one or more suitable 
buffers for nucleic acid synthesis and one or more nucleotides. Alternatively, 
the components of the kit may be divided into separate containers (e.g., one 
container for each enzyme and/or component). The kits of the invention also 
may comprise instructions or protocols for carrying out the methods of the 
invention. In preferred kits of the invention, the reverse transcriptases are 
modified or mutated such that the fidelity of cDNA synthesis is increased 
and/or enhanced, the reverse transcriptases are reduced or substantially 
reduced in RNase H activity, and are most preferably selected from the group 
consisting of M-MLV H" reverse transcriptase, RS V H" reverse transcriptase, 
AMV H" reverse transcriptase, RAV H~ reverse transcriptase, MAV H" reverse 
transcriptase and HIV H" reverse transcriptase. In additional preferred kits of 
the invention, the enzymes (reverse transcriptases and/or DNA polymerases) 
in the containers are present at working concentrations. 
[0028] Reverse transcriptases of the invention include any reverse 

transcriptase having (1) enhanced fidelity or (2) decreased or eliminated 
terminal deoxynucleotidyl transferase (TdT) activity. Reverse transcriptases 
may be single chained or multi-chained. Such reverse transcriptases include 
retroviral reverse transcriptases, bacterial reverse trancriptases, 
retrotransposon reverse trancriptases, and DNA polymerases having reverse 
trancriptase activity. Preferred reverse trancriptases of the invention include 
single- subunit reverse transcriptases (e.g., M-MLV RT) and multi-subunit 
reverse transcriptases (e.g., AMV RT) and preferably retroviral RTs. In 
particular, the invention relates to M-MLV RTs and ASLV RTs (such as 
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AMV-RT and RSV-RT). Such reverse trancriptases of the invention 
preferably have reduced or substantially reduced RNAse H activity. 
[0029] Other preferred embodiments of the present invention will be apparent 

to one of ordinary skill in light of the following drawings and description of 
the invention, and of the claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0030] These and other features, aspects, and advantages of the present 

invention will become better understood with reference to the following 
description and appended claims, and accompanying drawings where: 

[0031] Fig. 1 represents a scanned phosphoimage, which shows misinsertion 

assay of Superscript II (1) and mutant proteins V223H (2), V223F (3), and 
R110M (4) with DNA template. The 32 P-labeled 18-mer primer annealed to a 
47-mer DNA template (5 nM) was extended by equal units of RT protein at 
37°C for 30 min as seen in the extension reactions with all four nucleotides. 
The extension reactions were also performed in the presence of only 3 
complementary dNTPs; minus dCTP, minus dATP, minus TTP, and minus 
dGTP. The extension reactions were analyzed by 6% denaturing gel 
electrophoresis. In this assay, the higher efficiency of elongation of 
terminated primer with only three nucleotides will reflect the lower fidelity of 
the M-MLV RNase H" protein assayed. T, fully extended primer. P, 
non-extended primer. 

[0032] Fig. 2 represents a scanned phosphoimage, which shows misinsertion 

assay of Superscript II (1) and mutant R116M (7). Also shown are 
ThermoScript™ I (2), and Superscript II mutants F155Y (3), K193T (4), 
F156H (5), D153N (6), and V223R (8). Reaction conditions same as Fig. 1. 

[0033] Fig. 3 represents a scanned prophoimage, which shows misinsertion 

assay of Superscript II (1), mutants V223H (2), Q190F(4), K152R (5), T197A 
(7), and Y64W (8), along with mutants V223I (3) and K193C (6). Reaction 
conditions same as Fig. 1. 
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[0034] Fig. 4 represents a scanned phosphoimage of an extension assay using 

(1) Superscript II RT, and (2) F309N. The [ 32 P]-labeled 18-mer primer 
annealed to a 47-mer DNA template (5 nM) was extended by equal units of 
RT at 37°C for 30 minutes as seen in the extension reactions with all 4 
nucleotides. The extension reactions were analyzed by denaturing 6% gel 
electrophoresis. P, non-extended primer. 

[0035] Fig. 5 represents a scanned phosphoimage showing a TdT extension 

assay of Superscript™ II (SS II) RT and the mutants F309N, T197E and 
Y133A. The [ 32 P]-labeled 1 8-mer primer annealed to a 47-mer DNA template 
(5 nM) was extended with decreasing units of RT (see Methods) at 37°C for 
30 min with all four nucleotides. The extension reactions were analyzed by 
denaturing 6% gel electrophoresis. In this assay, extension past the 47 
nucleotide templates is considered non-template directed addition or TdT 
activity. P, non-extended primer. 

[0036] Fig. 6 represents a scanned phosphoimage showing misinsertion assays 

of Superscript II RT (1) and mutant protein F309N RT (2) with DNA 
template. The [ P]-labeled 18-mer primer annealed to a 47-mer DNA 
template (5 nM) was extended by equal units of RT protein at 37°C for 30 min 
as seen in the extension reactions with all four nucleotides. The extension 
reactions were also performed in the presence of only 3 complementary 
dNTPs; minus dCTP, minus dATP, minus TTP, and minus dGTP. The 
extension reactions were analyzed by denaturing 6% gel electrophoresis. In 
this assay, the higher efficiency of elongation of terminated primer with only 
three nucleotides will reflect the lower fidelity of the Superscript II RT 
assayed. P, non-extended primer. 

[0037] Fig. 7 represents a scanned phosphoimage showing a misinsertion 

assay of Superscript II RT (1) and mutant protein T197A/F309N RT (2) and 
V223H/F309N with DNA template. The [ 32 P]-labeled 18-mer primer 
annealed to a 47-mer DNA template (5 nM) was extended by equal units of 
RT protein at 37°C for 30 min as seen in the extension reactions with all four 
nucleotides. The extension reactions were also performed in the presence of 
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only 3 complementary dNTPs; minus dATP, and minus dCTP. The extension 
reactions were analyzed by denaturing 6% gel electrophoresis. In this assay, 
the higher efficiency of elongation of terminated primer with only three 
nucleotides will reflect the lower fidelity of the Superscript II RT assayed. P, 
non-extended primer. 

DETAILED DESCRIPTION 

[0038] In the description that follows, a number of terms used in recombinant 

DNA, virology and immunology are extensively utilized. In order to provide a 
clearer and consistent understanding of the specification and claims, including 
the scope to be given such terms, the following definitions are provided. 

[0039] Cloning vector. A plasmid, cosmid or phage DNA or other DNA 

molecule which is able to replicate autonomously in a host cell, and which is 
characterized by one or a small number of restriction endonuclease recognition 
sites at which such DNA sequences may be cut in a determinable fashion 
without loss of an essential biological function of the vector, and into which 
DNA may be spliced in order to bring about its replication and cloning. The 
cloning vector may further contain a marker suitable for use in the 
identification of cells transformed with the cloning vector. Markers, for 
example, are tetracycline resistance or ampicillin resistance. 

[0040] Expression vector. A vector similar to a cloning vector but which is 

capable of enhancing the expression of a gene or nucleic acid molecules which 
has been cloned into it, after transformation into a host. The cloned gene or 
nucleic acid molecule is usually placed under the control of (i.e., operably 
linked to) certain control sequences such as promoter sequences. 
[0041] Recombinant host. Any prokaryotic or eukaryotic or microorganism 

which contains the desired cloned genes or nucleic acid molecule in an 
expression vector, cloning vector or any DNA molecule. The term 
"recombinant host" is also meant to include those host cells which have been 
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genetically engineered to contain the desired gene or nucleic acid molecule on 
the host chromosome or genome. 
[0042] Host. Any prokaryotic or eukaryotic microorganism that is the 

recipient of a replicable expression vector, cloning vector or any DNA 
molecule. The DNA molecule may contain, but is not limited to, a structural 
gene, a promoter and/or an origin of replication. 
[0043] Promoter. A DNA sequence generally described as the 5' region of a 

gene, located proximal to the start codon. At the promoter region, 
transcription of an adjacent gene(s) is initiated. 
[0044] Gene. A DNA sequence that contains information necessary for 

expression of a polypeptide or protein. It includes the promoter and the 
structural gene as well as other sequences involved in expression of the 
protein. 

[0045] Structural gene. A DNA sequence that is transcribed into messenger 

RNA that is then translated into a sequence of amino acids characteristic of a 
specific polypeptide. 

[0046] Operably linked. As used herein means that the promoter is 

positioned to control the initiation of expression of the polypeptide encoded by 
the structural gene or other nucleic acid molecule. 

[0047] Expression. Expression is the process by which a gene or other 

nucleic acid molecule produces a polypeptide. It includes transcription of the 
gene or nucleic acid molecule into messenger RNA (mRNA) and the 
translation of such mRNA into polypeptide(s). 

[0048] Substantially Pure. As used herein "substantially pure" means that 

the desired purified protein is essentially free from contaminating cellular 
contaminants which are associated with the desired protein in nature. 
Contaminating cellular components may include, but are not limited to, 
phosphatases, exonucleases, endonucleases or undesirable DNA polymerase 
enzymes. Prefered reverse transcriptases of the invention are substantially 
pure. 
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[0049] Primer. As used herein "primer" refers to a single-stranded 

oligonucleotide that is extended by covalent bonding of nucleotide monomers 
during amplification or polymerization of a DNA molecule, 

[0050] Template. The term "template" as used herein refers to a 

double-stranded or single-stranded nucleic acid molecule which is to be 
amplified, copied or sequenced. In the case of a double-stranded DNA 
molecule, denaturation of its strands to form single-stranded first and second 
strands is performed before these molecules may be amplified, copied or 
sequenced. A primer, complementary to a portion of a nucleic acid template is 
hybridized under appropriate conditions and the reverse transcriptase of the 
invention may then synthesize a DNA molecule complementary to said 
template or a portion thereof. The newly synthesized DNA molecule, 
according to the invention, may be equal or shorter in length than the original 
template. Mismatch incorporation during the synthesis or extension of the 
newly synthesized DNA molecule may result in one or a number of 
mismatched base pairs. Thus, the synthesized DNA molecule need not be 
exactly complementary to the template. 

[0051] Incorporating. The term "incorporating" as used herein means 

becoming a part of a DNA molecule or primer. 

[0052] Oligonucleotide. "Oligonucleotide" refers to a synthetic or natural 

molecule comprising a covalently linked sequence of nucleotides which are 
joined by a phosphodiester bond between the 3' position of the pentose of one 
nucleotide and the 5' position of the pentose of the adjacent nucleotide. 

[0053] Nucleotide. As used herein "nucleotide" refers to a 

base-sugar-phosphate combination. Nucleotides are monomeric units of a 
nucleic acid sequence (DNA and RNA) and deoxyribonucleotides are 
"incorporated" into DNA by DNA polymerases. The term nucleotide includes 
deoxyribonucleoside triphosphates such as dATP, dCTP, dITP, dUTP, dGTP, 
dTTP, or derivatives thereof. Such derivatives include, for example, 
[aSJdATP, 7-deaza-dGTP and 7-deaza-dATP. The term nucleotide as used 
herein also refers to dideoxyribonucleoside triphosphates (ddNTPs) and their 
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derivatives. Illustrated examples of dideoxyribonucleoside triphosphates 
include, but are not limited to, ddATP, ddCTP, ddGTP, ddlTP, and ddTTP. 
According to the present invention, a "nucleotide" may be unlabeled or 
detectably labeled by well known techniques. Detectable labels include, for 
example, radioactive isotopes, fluorescent labels, chemiluminescent labels, 
bioluminescent labels and enzyme labels. 

[0054] Hybridization. The terms "hybridization" and "hybridizing" refer to 

the pairing of two complementary single-stranded nucleic acid molecules 
(RNA and/or DNA) to give a double-stranded molecule. As used herein, two 
nucleic acid molecules may be hybridized, although the base pairing is not 
completely complementary. Accordingly, mismatched bases do not prevent 
hybridization of two nucleic acid molecules provided that appropriate 
conditions, well known in the art, are used. 

[0055] Terminal extension. Terminal extension, as used herein, refers to the 

ability of a reverse transcriptase (RT) to add additional bases on to the 3 * end 
of a newly synthesized cDNA strand beyond the 5' end of the mRNA 
template. The activity may add bases specifically (with a nucleotide bias) or 
randomly. 

[0056] Terminal extension activity is also known as terminal deoxynucleotidyl 

transferase (TdT) activity. A reverse transcriptase having decreased or 
eliminated TdT activity is defined as any reverse transcriptase having lower 
TdT activity than the specific activity of the corresponding unmutated, 
unmodified or wild type enzyme, particularly, less than about 75% of the 
specific activity of the corresponding unmutated, unmodified or wild type 
enzyme, less than about 50% of the specific activity of the corresponding 
unmutated, unmodified or wild type enzyme, less than about 25% of the 
specific activity of the corresponding unmutated, unmodified or wild type 
enzyme, less than about 15% of the specific activity of the corresponding 
unmutated, unmodified or wild type enzyme, less than 10% of the specific 
activity of the corresponding unmutated, unmodified or wild type enzyme, less 
than about 5% of the specific activity of the corresponding unmutated, 



- 19- 



unmodified or wild type enzyme, or less than about 1% of the specific activity 
of the corresponding unmutated, unmodified or wild type enzyme. 
Eliminated TdT activity is defined as a level of activity that is undetectable by 
the assay methods utilized in Example 3. 
[0057] Strand jumping. Strand jumping, as used herein, refers to a type of 

random mutation caused by an RT "skipping" more than one (e.g., two, five, 
ten, fifty, one-hundred, etc.) nucleotides on the mRNA template, resulting in a 
deletion of the corresponding nucleotides in the resulting cDNA. 
[0058] Hand domain. The hand domain, as used herein, refers to those 

amino acids which are in the area or areas that control the template, primer, or 
nucleotide interaction of the reverse transcriptase. This domain is further 
characterized by a group of three regions of secondary structure in a reverse 
transcriptase enzyme, the thumb, fingers and palm regions. The thumb 
domain is defined as residing between amino acids 240-315 of HIV RT, or 
between amino acids 280-355 of M-MLV RT. The fingers domain is defined 
as residing between amino acids 1-85 and 120-154 of HIV RT, or between 1- 
124 and 161-193 of M-MLV RT. The palm domain is defined as residing 
between amino acids 86-199 and 155-239 of HIV RT, or between amino acids 
125-160 and 193-279 of M-MLV RT. These areas are generally defined, and 
the amino acids defining the N-termini and C-termini are approximate. 
Corresponding regions can also be defined for other reverse transcriptases. 
[0059] Fidelity. Fidelity refers to the accuracy of polymerization, or the 

ability of the reverse transcriptase to discriminate correct from incorrect 
substrates, (e.g., nucleotides) when synthesizing nucleic acid molecules which 
are complementary to a template. The higher the fidelity of a reverse 
transcriptase, the less the reverse transcriptase misincorporates nucleotides in 
the growing strand during nucleic acid synthesis; that is, an increase or 
enhancement in fidelity results in a more faithful reverse transcriptase having 
decreased error rate or decreased misincorporation rate. 
[0060] A reverse transcriptase having increased/enhanced/higher fidelity is 

defined as a polymerase having any increase in fidelity, preferably about 1.5 to 
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about 10,000 fold, about 2 to about 5,000 fold, or about 2 to about 2000 fold 
(preferably greater than about 5 fold, more preferably greater than about 10 
fold, still more preferably greater than about 50 fold, still more preferably 
greater than about 100 fold, still more preferably greater than about 500 fold 
and most preferably greater than about 100 fold) reduction in the number of 
misincorporated nucleotides during synthesis of any given nucleic acid 
molecule of a given length compared to the control reverse trancriptase. 
Preferably, the mutant or modified RT of the invention is compared to the 
corresponding unmodified or wild-type RT to determine the relative 
enhancement or increase in fidelity. For example, a mutated reverse 
transcriptase may misincorporate one nucleotide in the synthesis of a nucleic 
acid molecule segment of 1000 bases compared to an unmutated reverse 
transcriptase misincorporating 1 0 nucleotides in the same size segment. Such 
a mutant reverse transcriptase would be said to have an increase of fidelity of 
10 fold. 

[0061] Fidelity can also be measured by the decrease in the incidence of frame 

shifting, as described in Example 5. A reverse transcriptase having increased 
fidelity is defined as a polymerase or reverse transcriptase having any increase 
in fidelity with respect to frame shifting, as compared to a control reverse 
transcriptase (e.g., a wild-type RT), for example, a reverse transcriptase 
having greater than about 1.5 fold increased fidelity with respect to frame 
shifting, having greater than about 5 fold increased fidelity with respect to 
frame shifting, having greater than about 10 fold increased fidelity with 
respect to frame shifting, having greater than about 20 fold increased fidelity 
with respect to frame shifting, having greater than about 30 fold increased 
fidelity with respect to frame shifting, or having greater than about 40 fold 
increased fidelity with respect to frame shifting. 

[0062] A reverse transcriptase having increased/enhanced/higher fidelity, with 

respect to frame shifting, can also be defined as a reverse transcriptase or 
polymerase having any increase in fidelity, such as about 1.5 to about 10,000 
fold, about 2 to about 5,000 fold, about 2 to about 2000 fold, about 1.5 to 
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about 40 fold, about 5 to about 40 fold, about 10 to about 40 fold, about 20 to 
about 40 fold, about 30 to about 40 fold, about 5 to about 30 fold, about 10 to 
about 30 fold, about 15 to about 30 fold, about 20 to about 30 fold, about 5 to 
about 20 fold, about 10 to about 20 fold, about 15 to about 20 fold, about 10 to 
about 100 fold, about 15 to about 100 fold, about 20 to about 100 fold, about 
30 to about 100 fold, or about 50 to about 100 fold. 
[0063] A reverse transcriptase having reduced misincorporation is defined 

herein as either a mutated or modified reverse transcriptase that preferably has 
about or less than 50%, or preferably about or less than 25%, more preferably 
about or less than 10% and most preferably about or less than 1% of relative 
misincorporation compared to the corresponding unmutated, unmodified or 
wild type enzyme. 

[0064] The fidelity or misincorporation rate of a reverse transcriptase can be 

determined by sequencing or by other methods known in the art (Eckert & 
Kunkel, 1990, Nuc. Acids Res., 3739-3744). In one example, the sequence of 
a DNA molecule synthesized by the unmutated and mutated reverse 
transcriptase can be compared to the expected (known) sequence. In this way, 
the number of errors (misincorporation or frame shifts) can be determined for 
each enzyme and compared. In another example, the unmutated and mutated 
reverse transcriptase may be used to sequence a DNA molecule having a 
known sequence. The number of sequencing errors (misincorporation or 
frame shifts) can be compared to determine the fidelity or misincorporation 
rate of the enzymes. Other means of determining the fidelity or 
misincorporation rate include a forward complementation assay using an RNA 
template as described below and previously in Boyer J.C. et al. Methods 
Enzymol. 275: 523 (1996), and are set out in the examples. Other methods of 
determining the fidelity or misincorporation rate will be recognized by one of 
skill in the art. 

[0065] In general, the invention provides compositions for use in reverse 

transcription of a nucleic acid molecule comprising a reverse transcriptase 
with one or more mutations or modifications which render the reverse 
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transcriptase more efficient, that is having higher fidelity. The invention also 
provides compositions for use in reverse transcription of a nucleic acid 
molecule comprising a reverse transcriptase with one or more mutations or 
modifications which decrease TdT activity. 

[0066] The enzymes in these compositions are preferably present in working 

concentrations and are reduced or substantially reduced in RNase H activity. 
Alternatively, the reverse transcriptases used in the compositions of the 
invention may have RNase H activity. Preferred mutated or modified reverse 
transcriptases are derived from M-MLV reverse transcriptase, HIV reverse 
transcriptase, RSV reverse transcriptase, AMV reverse transcriptase, RAV 
reverse transcriptase, and MAV reverse transcriptase or other ASLV reverse 
transcriptases or their corresponding RNase H" derivatives. 

[0067] In accordance with the invention, any number of mutations can be 

made to the RTs and in a preferred aspect, multiple mutations can be made to 
result in an additive fidelity increase. Such mutations include point mutations, 
frame shift mutations, deletions and insertions, with one or more point 
mutations preferred. Preferably, oligonucleotide directed mutagenesis is used 
to create the mutant polymerases which allows for all possible classes of base 
pair changes at any determined site along the encoding DNA molecule. In 
general, this technique involves annealing an oligonucleotide complementary 
(except for one or more mismatches) to a single stranded nucleotide sequence 
coding for the RT of interest. The mismatched oligonucleotide is then 
extended by DNA polymerase, generating a double-stranded DNA molecule 
which contains the desired change in sequence in one strand. The changes in 
sequence can of course result in the deletion, substitution, or insertion of an 
amino acid. The double-stranded polynucleotide can then be inserted into an 
appropriate expression vector, and a mutant or modified polypeptide can thus 
be produced. The above-described oligonucleotide directed mutagenesis can 
of course be carried out via PCR. 

[0068] The invention is also directed to methods for reverse transcription of 

one or more nucleic acid molecules comprising mixing one or more nucleic 



acid templates, which is preferably RNA or messenger RNA (mRNA) and 
more preferably a population of mRNA molecules, with a mutant reverse 
transcriptase of the present invention and incubating the mixture under 
conditions sufficient to make a nucleic acid molecule or molecules 
complementary to all or a portion of the one or more templates. To make the 
nucleic acid molecule or molecules complementary to the one or more 
templates, a primer (e.g., an oligo(dT) primer) and one or more nucleotides are 
used for nucleic acid synthesis in the 3' to 5 ' direction. Nucleic acid molecules 
suitable for reverse transcription according to this aspect of the invention 
include any nucleic acid molecule, particularly those derived from a 
prokaryotic or eukaryotic cell. Such cells may include normal cells, diseased 
cells, transformed cells, established cells, progenitor cells, precursor cells, 
fetal cells, embryonic cells, bacterial cells, yeast cells, animal cells (including 
human cells), avian cells, plant cells and the like, or tissue isolated from a 
plant or an animal (e.g., human, cow, pig, mouse, sheep, horse, monkey, 
canine, feline, rat, rabbit, bird, fish, insect, etc.). Such nucleic acid molecules 
may also be isolated from viruses. 
[0069] The invention further provides methods for amplifying or sequencing a 

nucleic acid molecule comprising contacting the nucleic acid molecule with a 
reverse transcriptase of the present invention. Preferred such methods 
comprise one or more polymerase chain reactions (PCRs). 

Sources of Reverse Transcriptases 

[0070] Enzymes for use in the compositions, methods and kits of the invention 

include any enzyme having reverse transcriptase activity. Such enzymes 
include, but are not limited to, retroviral reverse transcriptase, retrotransposon 
reverse transcriptase, hepatitis B reverse transcriptase, cauliflower mosaic 
virus reverse transcriptase, bacterial reverse transcriptase, Tth DNA 
polymerase, Taq DNA polymerase (Saiki, R.K., et ah, Science 239:487-491 
(1988;; U.S. Patent Nos. 4,889,818 and 4,965,188), Tne DNA polymerase 
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(WO 96/10640), Tma DNA polymerase (U. S. Patent No. 5,374,553) and 
mutants, fragments, variants or derivatives thereof {see, e.g., commonly 
owned U.S. Patent Nos. 5,948,614 and 6,015,668, which are incorporated by 
reference herein in their entireties). Preferred reverse transcriptases for use in 
the invention include M-MLV RT, AMV RT, RSV RT, RAV RT, MAV RT 
and generally ASLV reverse transcriptases. As will be understood by one of 
ordinary skill in the art, modified reverse transcriptases may be obtained by 
recombinant or genetic engineering techniques that are routine and 
well-known in the art. Mutant reverse transcriptases can, for example, be 
obtained by mutating the gene or genes encoding the reverse transcriptase of 
interest by site-directed or random mutagenesis. Such mutations may include 
point mutations, deletion mutations and insertional mutations. Preferably, one 
or more point mutations {e.g., substitution of one or more amino acids with 
one or more different amino acids) are used to construct mutant reverse 
transcriptases of the invention. Fragments of reverse transcriptases may be 
obtained by deletion mutation by recombinant techniques that are routine and 
well-known in the art, or by enzymatic digestion of the reverse transcriptase(s) 
of interest using any of a number of well-known proteolytic enzymes. 
| Preferred enzymes for use in the invention include those that are 

reduced or substantially reduced in RNase H activity. Such enzymes that are 
reduced or substantially reduced in RNase H activity may be obtained by 
mutating the RNase H domain within the reverse transcriptase of interest, 
preferably by one or more point mutations, one or more deletion mutations, 
and/or one or more insertion mutations as described above. By an enzyme 
"substantially reduced in RNase H activity" is meant that the enzyme has less 
than about 30%, less than about 25%, 20%, more preferably less than about 
15%>, less than about 10%, less than about 7.5%, or less than about 5%, and 
most preferably less than about 5% or less than about 2%, or which lacks the 
RNase H activity of the corresponding wildtype or RNase H + enzyme such as 
wildtype Moloney Murine Leukemia Virus (M-MLV), Avian Myeloblastosis 
Virus (AMV) or Rous Sarcoma Virus (RSV) reverse transcriptases. RTs 
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having reduced or substantially reduced RNase H activity have been 
previously described (see U.S. Patent 5,244,797; 5,405,776; 5,668,005 and 
6,063,608; and WO 98/47912). The RNase H activity of any enzyme may be 
determined by a variety of assays, such as those described, for example, in 
U.S. Patent Nos. 5,244,797; 5,405,776; 5,668,005 and 6,063,608; in Kotewicz, 
MX., et al, Nucl Acids Res. 16:265 (1988); and in Gerard, G.F., et al, 
FOCUS 14(5)\9\ (1992), the disclosures of all of which are fully incorporated 
herein by reference. 

[0072] Particularly preferred mutated or modified enzymes for use in the 

invention include, but are not limited to, M-MLV H" reverse transcriptase, 
RSV H" reverse transcriptase, AMV H" reverse transcriptase, RAV H" reverse 
transcriptase, MAV H" reverse transcriptase and HIV H~ reverse transcriptase. 
It will be understood by one of ordinary skill, however, that any enzyme 
capable of producing a DNA molecule from a ribonucleic acid molecule (i.e., 
having reverse transcriptase activity) that is reduced or substantially reduced 
in RNase H activity may be equivalently used in the compositions, methods 
and kits of the invention. 

[0073] Polypeptides having reverse transcriptase activity for use in the 

invention may be isolated from their natural viral or bacterial sources 
according to standard procedures for isolating and purifying natural proteins 
that are well-known to one of ordinary skill in the art (see, e.g., Houts, G.E., et 
al, J. Virol 29:517 (1979)). In addition, the polypeptides having reverse 
transcriptase activity may be prepared by recombinant DNA techniques that 
are familiar to one of ordinary skill in the art (see, e.g., Kotewicz, ML., et al, 
Nucl Acids Res. 16:265 (1988); Soltis, D.A., and Skalka, A.M., Proc. Natl 
Acad Set USA 55:3372-3376 (1988)). 

[0074] In a preferred aspect of the invention, mutant or modified reverse 

transcriptases are made by recombinant techniques. A number of cloned 
reverse transcriptase genes are available or may be obtained using standard 
recombinant techniques (see U.S. Patent Nos. 5,244,797; 5,405,776; 5,668,005 
and 6,063,608 and WO 98/47912). 
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[0075] To clone a gene encoding a reverse transcriptase which will be 

modified in accordance with the invention, isolated DNA which contains the 
reverse transcriptase gene is used to construct a recombinant DNA library in a 
vector. Any vector, well known in the art, can be used to clone the reverse 
transcriptase of interest. However, the vector used must be compatible with 
the host in which the recombinant DNA library will be transformed. 

[0076] Prokaryotic vectors for constructing the plasmid library include 

plasmids such as those capable of replication in E. coli such as, for example, 
pBR322, ColEl, pSC101 ? pUC-vectors (pUC18, pUC19, etc.: In: Molecular 
Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, New York (1982); and Sambrook et aL> In: Molecular Cloning 
A Laboratory Manual (2d ed.) Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, New York (1989)). Bacillus plasmids include pC194, pC221> 
pC217, etc. Such plasmids are disclosed by Glyczan, T. In: The Molecular 
Biology Bacilli, Academic Press, York (1982), 307-329. Suitable 
Streptomyces plasmids include pIJlOl (Kendall et al, J. Bacteriol 
169:4177-4183 (1987)). Pseudomonas plasmids are reviewed by John et al, 
(Rad. Insec. Dis. 5:693-704 (1986)), and Igaki, (Jpn. J. Bacteriol 33:729-742 
(1978)). Broad-host range plasmids or cosmids, such as pCP13 (Darzins and 
Chakrabarbary, J. Bacteriol. 7JP:9-18, 1984) can also be used for the present 
invention. The preferred vectors for cloning the genes of the present invention 
are prokaryotic vectors. Preferably, pCP13 and pUC vectors are used to clone 
the genes of the present invention. 

[0077] The preferred host for cloning the reverse transcriptase genes of 

interest is a prokaryotic host. The most preferred prokaryotic host is E. coli. 
However, the desired reverse transcriptase genes of the present invention may 
be cloned in other prokaryotic hosts including, but not limited to, Escherichia, 
Bacillus, Streptomyces, Pseudomonas, Salmonella, Serratia, and Proteus. 
Bacterial hosts of particular interest include E. coli DH10B, which may be 
obtained from Life Technologies, a Division of Invitrogen Corporation 
(Rockville, MD). 



1 0078] Eukaryotic hosts for cloning and expression of the reverse transcriptase 

of interest include yeast, fungi, and mammalian cells. Expression of the 
desired reverse transcriptase in such eukaryotic cells may require the use of 
eukaryotic regulatory regions which include eukaryotic promoters. Cloning 
and expressing the reverse transcriptase gene in eukaryotic cells may be 
accomplished by well known techniques using well known eukaryotic vector 
systems. 

[0079] Once a DNA library has been constructed in a particular vector, an 

appropriate host is transformed by well known techniques. Transformed 
colonies are plated at a density of approximately 200-300 colonies per petri 
dish. For selection of reverse transcriptase, colonies are then screened for the 
expression of a reverse transcriptase as described in the Examples below. 
Briefly, overnight cultures of individual transformant colonies are assayed 
directly for RT using a labeled deoxynucleotide and analyzed for the presence 
of labeled product. If RT activity is detected, the mutant is sequenced to 
determine which amino acids maintained detectable RT activity. The gene 
encoding a reverse transcriptase of the present invention can be cloned using 
techniques known to a person in the art. 

Modifications or Mutations of Polymerases 
[0080] Preferably, the polymerase domain, i.e. fingers, palm, thumb regions, 

as defined herein, specifically those amino acids which are in the area or areas 
that controls the template, primer, or nucleotide interaction of the reverse 
transcriptase of interest is modified or mutated in such a way as to produce a 
mutated or modified reverse transcriptase having increased or enhanced 
fidelity (decreased misincorporation rate) and/or decreased TdT activity. 
Modifications or mutations may also be made in other regions in accordance 
with the invention. One or more mutations may be made in any reverse 
transcriptase in order to increase fidelity or decrease the TdT activity of the 
enzyme in accordance with the invention. Such mutations include point 
mutations, frame shift mutations, deletions and insertions. Preferably, one or 
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more point mutations, resulting in one or more amino acid substitutions, are 
used to produce reverse trancriptases having enhanced or increased fidelity or 
decreased or eliminated TdT activity. The numbering of amino acids in the 
M-MLV reverse transcriptase is based on the mature peptide, in which the N- 
terminal methionine has been proteolytically removed. In a preferred aspect 
of the invention, one or more mutations at positions equivalent or 
corresponding to position Y64, R116, D152, Q190, T197, D124, HI 26, Y133 
and V223 of M-MLV may be made to produce RTs with increased fidelity. 
Most preferably, a mutation at position T197 within the palm domain of the 
polymerase results in reverse transcriptases having increased fidelity and/or 
reduced misincorporation rate. In another aspect of the invention, one or more 
mutations at positions equivalent or corresponding to F309, T197 or Y133 
may be made to produce RTs with decreased or eliminated TdT activity. In 
this preferred aspect, amino acid substitutions are made at one or more of the 
above identified positions. Thus, the amino acids at these positions may be 
substituted with any other amino acid including Ala, Asn, Asp, Cys, Gin, Glu, 
Gly, His, lie, Leu, Lys, Met, Phe, Pro, Ser, Thr, Tip, Tyr, and Val. 

The corresponding positions of M-MLV RT identified above may be 
readily identified for other reverse transcriptases by one with skill in the art. 
Thus, given the defined region and the assays described in the present 
application, one with skill in the art can make one or a number of 
modifications which would result in increased fidelity of any reverse 
trancriptase of interest. The following table illustrates identified regions of 
interest for known reverse trancriptases. 
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Table 1 

Reverse transcriptase Sequence locations of high fidelity mutants 

M-MLV Y64, R116, K152, Q190, T197, V223, D124, 

H126, Y133 

AMV W25, R76, Kl 10, Q149, T156, M182 

RSV W25, R76, K110, Q149, T156, M182 

HIV W24, R78, Gl 12, Q151, A158, M184 

[0082] The nucleotide sequence for M-MLV (Shinnick et al Nature 293 \ 543 

(1981)), AMV (Joliot et al Virology 195: 812 (1993)), RSV (Haseltine et al 
Proc. Natl Acad. Set USA 74: 989 (1977)), and HIV (Wong-Staal et al. 
Nature 313: 277 (1985)) is known. 

[0083] The invention also relates to reverse transcriptase mutants, where the 

mutations or substitutions have been made in a recognized region of the 
reverse transcriptase enzyme. Such regions include, but are not limited to, the 
fingers, palm and/or thumb regions (or combinations therof). In a preferred 
embodiment of the invention, the mutations or substitutions are made in the 
thumb region, mutations in which have been shown to decrease the incidence 
of frame shifting. Methods for measuring the frame shifting rate are described 
in the examples. 

[0084] Amino acids that may be substituted for Tyr include Lys, Arg, His, 

Asp, Glu, Ala, Val, Leu, He, Pro, Met, Trp, Gly, Ser, Thr, Cys, Asn or Gin. 
Amino acids that may be substituted for Arg include Tyr, His, Asp, Glu, Ala, 
Val, Leu, He, Pro, Met, Trp, Gly, Ser, Thr, Cys, Phe, Asn or Gin. Amino acids 
that may be substituted for Lys include Tyr, Arg, His, Asp, Glu, Ala, Val, Leu, 
He, Pro, Met, Trp, Gly, Ser, Thr, Cys, Asn or Gin. Amino acids that may be 
substituted for Glu include Lys, Arg, His, Asp, Tyr, Ala, Val, Leu, lie, Pro, 
Met, Trp, Gly, Ser, Thr, Cys, Asn or Gin. Amino acids that may be 
substituted for Thr include Lys, Arg, His, Asp, Glu, Ala, Val, Leu, lie, Pro, 
Met, Trp, Gly, Ser, Tyr, Cys, Asn or Gin. Amino acids that may be 
substituted for Val or include Lys, Arg, His, Asp, Glu, Ala, Tyr, Leu, lie, Pro, 
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Met, Trp, Gly, Ser, Thr, Cys, Asn or Gin. Such mutants may be prepared by 
well known methods of site directed mutagenesis and as described herein. 
[0085] Preferably, oligonucleotide directed mutagenesis is used to create the 

mutant reverse transcriptases which allows for all possible classes of base pair 
changes at any determined site along the encoding DNA molecule. In general, 
this technique involves annealing an oligonucleotide complementary (except 
for one or more mismatches) to a single stranded nucleotide sequence coding 
for the reverse trancriptase of interest. The mismatched oligonucleotide is 
then extended by DNA polymerase, generating a double stranded DNA 
molecule which contains the desired change in sequence on one strand. The 
changes in sequence can of course result in the deletion, substitution, or 
insertion of an amino acid. The double stranded polynucleotide can then be 
inserted into an appropriate expression vector, and a mutant polypeptide can 
thus be produced. The above-described oligonucleotide directed mutagenesis 
can of course be carried out via PCR. 

Enhancing Expression of Reverse Transcriptases 

[0086] To optimize expression of the reverse transcriptases of the present 

invention, inducible or constitutive promoters are well known and may be 
used to express high levels of a reverse transcriptase structural gene in a 
recombinant host. Similarly, high copy number vectors, well known in the art, 
may be used to achieve high levels of expression. Vectors having an inducible 
high copy number may also be useful to enhance expression of the reverse 
transcriptases of the invention in a recombinant host. 

[0087] To express the desired structural gene in a prokaryotic cell (such as, E. 

coli, B. subtilis, Pseudomonas, etc.), it is necessary to operably link the desired 
structural gene to a functional prokaryotic promoter. However, the natural 
promoter of the reverse transcriptase gene may function in prokaryotic hosts 
allowing expression of the reverse transcriptase gene. Thus, the natural 
promoter or other promoters may be used to express the reverse trancriptase 
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gene. Such other promoters that may be used to enhance expression include 
constitutive or regulatable (i.e., inducible or derepressible) promoters. 
Examples of constitutive promoters include the int promoter of 
bacteriophage 1, and the bla promoter of the ^-lactamase gene of pBR322. 
Examples of inducible prokaryotic promoters include the major right and left 
promoters of bacteriophage 1 (Pr and Pl), trp, recA, lacL, lac\ tet, gal trc, 
and tac promoters of E. coli. The B. subtilis promoters include a-amylase 
(Ulmanen et al, J. Bacteriol 162:176-182 (1985)) and Bacillus bacteriophage 
promoters (Gryczan, T., In: The Molecular Biology Of Bacilli, Academic 
Press, New York (1982)). Streptomyces promoters are described by Ward et 
al, Mol Gen. Genet, 205:468478 (1986)). Prokaryotic promoters are also 
reviewed by Glick, J. Ind Microbiol. 1:277-282 (1987); Cenatiempto, Y., 
Biochimie 68:505-516 (1986); and Gottesman, Ann, Rev. Genet 75:415-442 
(1984). Expression in a prokaryotic cell also requires the presence of a 
ribosomal binding site upstream of the gene-encoding sequence. Such 
ribosomal binding sites are disclosed, for example, by Gold et al, Ann. Rev. 
Microbiol 35:365404 (1981). 

To enhance the expression of polymerases of the invention in a 
eukaryotic cell, well known eukaryotic promoters and hosts may be used. 
Preferably, however, enhanced expression of the polymerases is accomplished in 
a prokaryotic host. The preferred prokaryotic host for overexpressing this 
enzyme is E, coli. 

Isolation and Purification of Reverse Transcriptases 

The enzyme(s) of the present invention is preferably produced by 
fermentation of the recombinant host containing and expressing the desired 
reverse transcriptase gene. However, the reverse transcriptase of the present 
invention may be isolated from any strain which produces the reverse 
transcriptase of the present invention. Fragments of the reverse transcriptase 
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are also included in the present invention. Such fragments include proteolytic 
fragments and fragments having reverse transcriptase activity. 
[0090] Any nutrient that can be assimilated by a host containing the cloned 

reverse transcriptase gene may be added to the culture medium. Optimal 
culture conditions should be selected case by case according to the strain used 
and the composition of the culture medium. Antibiotics may also be added to 
the growth media to insure maintenance of vector DNA containing the desired 
gene to be expressed. Media formulations have been described in DSM or 
ATCC Catalogs and Sambrook et aL, In: Molecular Cloning, a Laboratory 
Manual (2nd ed.), Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
NY (1989). 

[0091] Recombinant host cells producing the reverse transcriptases of this 

invention can be separated from liquid culture, for example, by centrifugation. 
In general, the collected microbial cells are dispersed in a suitable buffer, and 
then broken down by ultrasonic treatment or by other well known procedures 
to allow extraction of the enzymes by the buffer solution. After removal of 
cell debris by ultracentrifugation or centrifugation, the reverse transcriptases 
can be purified by standard protein purification techniques such as extraction, 
precipitation, chromatography, affinity chromatography, electrophoresis or the 
like. Assays to detect the presence of the reverse transcriptase during 
purification are well known in the art and can be used during conventional 
biochemical purification methods to determine the presence of these enzymes. 

[0092] The reverse transcriptases of the invention preferably have specific 

DNA polymerase activities greater than about 5 units/mg, more preferably 
greater than about 50 units/mg, still more preferably greater than about 100 
units/mg, 250 units/mg, 500 units/mg, 1000 units/mg, 5000 units/mg or 10,000 
units/mg, and most preferably greater than about 15,000 units/mg, greater than 
about 16,000 units/mg, greater than about 17,000 units/mg, greater than about 
18,000 units/mg, greater than about 19,000 units/mg and greater than about 
20,000 units/mg. Preferred ranges of specific activities for the RTs of the 
invention include a specific activity from about 5 units/mg to about 140,000 



units/mg, a specific activity from about 5 units/mg to about 125,000 units/mg, 
a specific activity of from about 50 units/mg to about 100,000 units/mg, a 
specific activity from about 100 units/mg to about 100,000 units/mg, a specific 
activity from about 250 units/mg to about 100,000 units/mg, a specific activity 
from about 500 units/mg to about 100,000 units/mg, a specific activity from 
about 1000 units/mg to about 100,000 units/mg, a specific activity from about 
5000 units/mg to about 100,000 units/mg, a specific activity from about 
10,000 units/mg to about 100,000 units/mg, a specific activity from about 
25,000 units/mg to about 75,000 units/mg. Other preferred ranges of specific 
activities include a specific activity of from about 20,000 units/mg to about 
140,000 units/mg, a specific activity from about 20,000 units/mg to about 
130,000 units/mg, a specific activity from about 20,000 units/mg to about 
120,000 units/mg, a specific activity from about 20,000 units/mg to about 
110,000 units/mg, a specific activity from about 20,000 units/mg to about 
100,000 units/mg, a specific activity from about 20,000 units/mg to about 
90,000 units/mg, a specific activity from about 25,000 units/mg to about 
140,000 units/mg, a specific activity from about 25,000 units/mg to about 
130,000 units/mg, a specific activity from about 25,000 units/mg to about 
120,000 units/mg, a specific activity from about 25,000 units/mg to about 
110,000 units/mg, a specific activity from about 25,000 units/mg to about 
100,000 units/mg, and a specific activity from about 25,000 units/mg to about 
90,000 units/mg. Preferably, the lower end of the specific activity range may 
vary from 30,000, 35,000, 40,000, 45,000, 50,000, 5,000, 60,000, 65,000, 
70,000, 75,000, and 80,000 units/mg, while the upper end of the range may 
vary from 150,000, 140,000, 130,000, 120,000, 110,000, 100,000, and 90,000 
units/mg. In accordance with the invention, specific activity is a measurement 
of the enzymatic activity (in units) of the protein or enzyme relative to the 
total amount of protein or enzyme used in a reaction. The measurement of a 
specific activity may be determined by standard techniques well-known to one 
of ordinary skill in the art. 
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[0093] The RTs of the invention may be used to make nucleic acid molecules 

from one or more templates. Such methods comprise mixing one or more 
nucleic acid templates (e.g., mRNA, and more preferably a population of 
mRNA molecules) with one or more of the RTs of the invention and 
incubating the mixture under conditions sufficient to make one or more 
nucleic acid molecules complementary to all or a portion of the one or more 
nucleic acid templates. 

[0094] The invention also relates to methods for the amplification of one or 

more nucleic acid molecules comprising mixing one or more nucleic acid 
templates with one of the RTs of the invention, and incubating the mixture 
under conditions sufficient to amplify one or more nucleic acid molecules 
complementary to all or a portion of the one or more nucleic acid templates. 

[0095] The invention also concerns methods for the sequencing of one or 

more nucleic acid molecules comprising (a) mixing one or more nucleic acid 
molecules to be sequenced with one or more primer nucleic acid molecules, 
one or more RTs of the invention, one or more nucleotides and one or more 
terminating agents; (b) incubating the mixture under conditions sufficient to 
synthesize a population of nucleic acid molecules complementary to all or a 
portion of the one or more nucleic acid molecules to be sequenced; and (c) 
separating the population of nucleic acid molecules to determine the 
nucleotide sequence of all or a portion of the one or more nucleic acid 
molecules to be sequenced. 

[0096] The invention also concerns nucleic acid molecules produced by such 

methods (which may be full-length cDNA molecules), vectors (particularly 
expression vectors) comprising these nucleic acid molecules and host cells 
comprising these vectors and nucleic acid molecules. 

Sources of DNA Polymerase 

[0097] A variety of DNA polymerases are useful in accordance with the 

present invention. Such polymerases include, but are not limited to, Thermus 
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thermophilus (Tth) DNA polymerase, Thermus aquaticus (Taq) DNA 
polymerase, Thermotoga neapolitana (Tne) DNA polymerase, Thermotoga 
maritima (Tma) DNA polymerase, Thermococcus litoralis (Tli or VENT™) 
DNA polymerase, Pyrococcus furiosis (Pfu) DNA polymerase, DEEPVENT™ 
DNA polymerase, Pyrococcus woosii (Pwo) DNA polymerase, Bacillus 
sterothermophilus (Bsf) DNA polymerase, Bacillus caldophilus (Bca) DNA 
polymerase, Sulfolobus acidocaldarius (Sac) DNA polymerase, 
Thermoplasma acidophilum (Tad) DNA polymerase, Thermus flavus (Tfl/Tub) 
DNA polymerase, Thermus ruber (Tru) DNA polymerase, Thermus 
brockianus (DYNAZYME™) DNA polymerase, Methanobacterium 
thermoautotrophicum (Mth) DNA polymerase, Mycobacterium spp. DNA 
polymerase (Mtb, Mlep), and mutants, variants and derivatives thereof. 
[0098] DNA polymerases used in accordance with the invention may be any 

enzyme that can synthesize a DNA molecule from a nucleic acid template, 
typically in the 5' to 3' direction. Such polymerases may be mesophilic or 
thermophilic, but are preferably thermophilic. Mesophilic polymerases include 
T5 DNA polymerase, T7 DNA polymerase, Klenow fragment DNA polymerase, 
DNA polymerase III, and the like. Preferred DNA polymerases are thermostable 
DNA polymerases such as Taq, Tne, Tma, Pfu, VENT™, DEEPVENT™, Tth 
and mutants, variants and derivatives thereof (U.S. Patent No. 5,436,149; U.S. 
Patent No. 5,512,462; WO 92/06188; WO 92/06200; WO 96/10640; Barnes, 
W.M., Gene 772:29-35 (1992); Lawyer, F.C., etal, PGR Meth Appl 2:275-287 
(1993); Flaman, J.-M., et aL, Nucl Acids Res. 22(15) :3259-3260 (1994)). For 
amplification of long nucleic acid molecules (e.g., nucleic acid molecules longer 
than about 3-5 Kb in length), at least two DNA polymerases (one substantially 
lacking 3' exonuclease activity and the other having 3' exonuclease activity) are 
typically used. See U.S. Patent No. 5,436,149; U.S. Patent No. 5,512,462; 
Barnes, W.M., Gene 772:29-35 (1992); and commonly owned, co-pending U.S. 
Patent Application No. 09/741,664, filed December 21, 2000, and corresponding 
European Application 0942917, the disclosures of all of which are incorporated 
herein in their entireties. Examples of DNA polymerases substantially lacking in 
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y exonuclease activity include, but are not limited to, Tag, Tne(exo~), Tma, 
P/w(exo~), Pwo and Tth DNA polymerases, and mutants, variants and derivatives 
thereof. Nonlimiting examples of DNA polymerases having 3 T exonuclease 
activity include 7^/DEEPVENT™ and T///VENT™ and mutants, variants and 
derivatives thereof. 

Formulation of Enzyme Compositions 

[0100] To form the compositions of the present invention, one or more reverse 

transcriptases are preferably admixed in a buffered salt solution. One or more 
DNA polymerases and/or one or more nucleotides, and/or one or more primers 
may optionally be added to make the compositions of the invention. More 
preferably, the enzymes are provided at working concentrations in stable 
buffered salt solutions. The terms "stable" and "stability" as used herein 
generally mean the retention by a composition, such as an enzyme 
composition, of at least 70%, preferably at least 80%, and most preferably at 
least 90% ? of the original enzymatic activity (in units) after the enzyme or 
composition containing the enzyme has been stored for about one week at a 
temperature of about 4°C, about two to six months at a temperature of about 
-20°C, and about six months or longer at a temperature of about -80°C. As 
used herein, the term "working concentration" means the concentration of an 
enzyme that is at or near the optimal concentration used in a solution to 
perform a particular function (such as reverse transcription of nucleic acids). 

[0101] The water used in forming the compositions of the present invention is 

preferably distilled, deionized and sterile filtered (through a 0.1-0.2 
micrometer filter), and is free of contamination by DNase and RNase 
enzymes. Such water is available commercially, for example from Sigma 
Chemical Company (Saint Louis, Missouri), or may be made as needed 
according to methods well known to those skilled in the art. 

[0102] In addition to the enzyme components, the present compositions 

preferably comprise one or more buffers and cofactors necessary for synthesis 
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of a nucleic acid molecule such as a cDNA molecule. Particularly preferred 
buffers for use in forming the present compositions are the acetate, sulfate, 
hydrochloride, phosphate or free acid forms of 
Tris-(hydroxymethyl)aminomethane (TRIS®), although alternative buffers of 
the same approximate ionic strength and pKa as TRIS® may be used with 
equivalent results. In addition to the buffer salts, cofactor salts such as those 
of potassium (preferably potassium chloride or potassium acetate) and 
magnesium (preferably magnesium chloride or magnesium acetate) are 
included in the compositions. Addition of one or more carbohydrates and/or 
sugars to the compositions and/or synthesis reaction mixtures may also be 
advantageous, to support enhanced stability of the compositions and/or 
reaction mixtures upon storage. Preferred such carbohydrates or sugars for 
inclusion in the compositions and/or synthesis reaction mixtures of the 
invention include, but are not limited to, sucrose, trehalose, and the like. 
Furthermore, such carbohydrates and/or sugars may be added to the storage 
buffers for the enzymes used in the production of the enzyme compositions 
and kits of the invention. Such carbohydrates and/or sugars are commercially 
available from a number of sources, including Sigma (St. Louis, MO). 

It is often preferable to first dissolve the buffer salts, cofactor salts and 
carbohydrates or sugars at working concentrations in water and to adjust the 
pH of the solution prior to addition of the enzymes. In this way, the 
pH-sensitive enzymes will be less subject to acid- or alkaline-mediated 
inactivation during formulation of the present compositions. 

To formulate the buffered salts solution, a buffer salt which is 
preferably a salt of Tris(hydroxymethyl)aminomethane (TRIS®), and most 
preferably the hydrochloride salt thereof, is combined with a sufficient 
quantity of water to yield a solution having a TRIS® concentration of 5-150 
millimolar, preferably 10-60 millimolar, and most preferably about 20-60 
millimolar. To this solution, a salt of magnesium (preferably either the 
chloride or acetate salt thereof) may be added to provide a working 
concentration thereof of 1-10 millimolar, preferably 1.5-8.0 millimolar, and 
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most preferably about 3-7.5 millimolar. A salt of potassium (preferably a 
chloride or acetate salt of potassium) may also be added to the solution, at a 
working concentration of 10-100 millimolar and most preferably about 75 
millimolar. A reducing agent such as dithiothreitol may be added to the 
solution, preferably at a final concentration of about 1-100 mM, more 
preferably a concentration of about 5-50 mM or about 7.5-20 mM, and most 
preferably at a concentration of about 10 mM. Preferred concentrations of 
carbohydrates and/or sugars for inclusion in the compositions of the invention 
range from about 5% (w/v) to about 30% (w/v), about 7.5% (w/v) to about 
25% (w/v), about 10% (w/v) to about 25% (w/v), about 10% (w/v) to about 
20% (w/v), and preferably about 10% (w/v) to about 15% (w/v). A small 
amount of a salt of ethylenediaminetetraacetate (EDTA), such as disodium 
EDTA, may also be added (preferably about 0.1 millimolar), although 
inclusion of EDTA does not appear to be essential to the function or stability 
of the compositions of the present invention. After addition of all buffers and 
salts, this buffered salt solution is mixed well until all salts are dissolved, and 
the pH is adjusted using methods known in the art to a pH value of 7.4 to 9.2, 
preferably 8.0 to 9.0, and most preferably about 8.4. 
[0105] To these buffered salt solutions, the enzymes (reverse transcriptases 

and/or DNA polymerases) are added to produce the compositions of the 
present invention. M-MLV RTs are preferably added at a working 
concentration in the solution of about 1,000 to about 50,000 units per 
milliliter, about 2,000 to about 30,000 units per milliliter, about 2,500 to about 
25,000 units per milliliter, about 3,000 to about 22,500 units per milliliter, 
about 4,000 to about 20,000 units per milliliter, and most preferably at a 
working concentration of about 5,000 to about 20,000 units per milliliter. 
AMV RTs, RSV RTs and HIV RTs, including those of the invention described 
above, are preferably added at a working concentration in the solution of about 
100 to about 5000 units per milliliter, about 125 to about 4000 units per 
milliliter, about 150 to about 3000 units per milliliter, about 200 to about 2500 
units per milliliter, about 225 to about 2000 units per milliliter, and most 
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preferably at a working concentration of about 250 to about 1000 units per 
milliliter. The enzymes in the thermophilic DNA polymerase group (Tag, Tne, 
Tma, Pfu, VENT, DEEPVENT, Tth and mutants, variants and derivatives 
thereof) are preferably added at a working concentration in the solution of 
about 100 to about 1000 units per milliliter, about 125 to about 750 units per 
milliliter, about 150 to about 700 units per milliliter, about 200 to about 650 
units per milliliter, about 225 to about 550 units per milliliter, and most 
preferably at a working concentration of about 250 to about 500 units per 
milliliter. The enzymes may be added to the solution in any order, or may be 
added simultaneously. 
[0106] The compositions of the invention may further comprise one or more 

nucleotides, which are preferably deoxy nucleoside triphosphates (dNTPs) or 
dideoxynucleoside triphosphates (ddNTPs). The dNTP components of the 
present compositions serve as the "building blocks" for newly synthesized 
nucleic acids, being incorporated therein by the action of the polymerases, and 
the ddNTPs may be used in sequencing methods according to the invention. 
Examples of nucleotides suitable for use in the present compositions include, 
but are not limited to, dUTP, dATP, dTTP, dCTP, dGTP, dITP, 
7-deaza-dGTP, a-thio-dATP 5 a-thio-dTTP, a-thio-dGTP, a-thio-dCTP, 
ddUTP, ddATP, ddTTP, ddCTP, ddGTP, ddlTP, 7-deaza-ddGTP, 
oc-thio-ddATP, oc-thio-ddTTP, cc-thio-ddGTP, a-thio-ddCTP or derivatives 
thereof, all of which are available commercially from sources including Life 
Technologies, a Division of Invitrogen Corporation (Rockville, Maryland), 
New England BioLabs (Beverly, Massachusetts) and Sigma Chemical 
Company (Saint Louis, Missouri). The nucleotides may be unlabeled, or they 
may be detectably labeled by coupling them by methods known in the ait with 
radioisotopes (e.g., 3 H, 14 C, o2 V or 35 S), vitamins (e.g., biotin), fluorescent 
moieties (e.g., fluorescein, rhodamine, Texas Red, or phycoerythrin), 
chemiluminescent labels (e.g., using the PHOTO-GENE™ or ACES™ 
chemiluminescence systems, available commercially from Life Technologies, 
a Division of Invitrogen Corporation, Rockville, Maryland), dioxigenin and 
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the like. Labeled nucleotides may also be obtained commercially, for example 
from Life Technologies, a Division of Invitrogen Corporation (Rockville, 
Maryland) or Sigma Chemical Company (Saint Louis, Missouri). In the 
present compositions, the nucleotides are added to give a working 
concentration of each nucleotide of about 10-4000 micromolar, about 50-2000 
micromolar, about 100-1500 micromolar, or about 200-1200 micromolar, and 
most preferably a concentration of about 1000 micromolar. 
[0107] To reduce component deterioration, storage of the reagent 

compositions is preferably at about 4°C for up to one day, or most preferably 
at -20°C for up to one year. 
[0108] In another aspect, the compositions and reverse transcriptases of the 

invention may be prepared and stored in dry form in the presence of one or 
more carbohydrates, sugars, or synthetic polymers. Preferred carbohydrates, 
sugars or polymers for the preparation of dried compositions or reverse 
transcriptases include, but are not limited to, sucrose, trehalose, and 
polyvinylpyrrolidone (PVP) or combinations thereof. See, e.g., U.S. Patent 
Nos. 5,098,893, 4,891,319, and 5,556,771, the disclosures of which are 
entirely incorporated herein by reference. Such dried compositions and 
enzymes may be stored at various temperatures for extended times without 
significant deterioration of enzymes or components of the compositions of the 
invention. Preferably, the dried reverse transcriptases or compositions are 
stored at 4°C or at -20°C. 

Production of cDNA Molecules 

Sources of Nucleic Acid Molecules 

[0109] In accordance with the invention, cDNA molecules (single-stranded or 

double-stranded) may be prepared from a variety of nucleic acid template 
molecules. Preferred nucleic acid molecules for use in the present invention 
include single- stranded or double- stranded DNA and RNA molecules, as well 
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as double-stranded DNA:RNA hybrids. More preferred nucleic acid 
molecules include messenger RNA (mRNA), transfer RNA (tRNA) and 
ribosomal RNA (rRNA) molecules, although mRNA molecules are the 
preferred template according to the invention. 

[0110] The nucleic acid molecules that are used to prepare cDNA molecules 

according to the methods of the present invention may be prepared 
synthetically according to standard organic chemical synthesis methods that 
will be familiar to one of ordinary skill. More preferably, the nucleic acid 
molecules may be obtained from natural sources, such as a variety of cells, 
tissues, organs or organisms. Cells that may be used as sources of nucleic acid 
molecules may be prokaryotic (bacterial cells, including but not limited to 
those of species of the genera Escherichia, Bacillus, Serratia, Salmonella, 
Staphylococcus, Streptococcus, Clostridium, Chlamydia, Neisseria, 
Treponema, Mycoplasma, Borrelia, Legionella, Pseudomonas, 
Mycobacterium, Helicobacter, Erwinia, Agrobacterium, Rhizobium, 
Xanthomonas and Streptomyces) or eukaryotic (including fungi (especially 
yeasts), plants, protozoans and other parasites, and animals including insects 
(particularly Drosophila spp. cells), nematodes (particularly Caenorhabditis 
elegans cells), and mammals (particularly human cells)). 

[0111] Mammalian somatic cells that may be used as sources of nucleic acids 

include blood cells (reticulocytes and leukocytes), endothelial cells, epithelial 
cells, neuronal cells (from the central or peripheral nervous systems), muscle 
cells (including myocytes and myoblasts from skeletal, smooth or cardiac 
muscle), connective tissue cells (including fibroblasts, adipocytes, 
chondrocytes, chondroblasts, osteocytes and osteoblasts) and other stromal 
cells (e.g., macrophages, dendritic cells, Schwann cells). Mammalian germ 
cells (spermatocytes and oocytes) may also be used as sources of nucleic acids 
for use in the invention, as may the progenitors, precursors and stem cells that 
give rise to the above somatic and germ cells. Also suitable for use as nucleic 
acid sources are mammalian tissues or organs such as those derived from 
brain, kidney, liver, pancreas, blood, bone marrow, muscle, nervous, skin, 
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genitourinary, circulatory, lymphoid, gastrointestinal and connective tissue 
sources, as well as those derived from a mammalian (including human) 
embryo or fetus. 

Any of the above prokaryotic or eukaryotic cells, tissues and organs 
may be normal, diseased, transformed, established, progenitors, precursors, 
fetal or embryonic. Diseased cells may, for example, include those involved 
in infectious diseases (caused by bacteria, fungi or yeast, viruses (including 
AIDS, HIV, HTLV, herpes, hepatitis and the like) or parasites), in genetic or 
biochemical pathologies (e.g., cystic fibrosis, hemophilia, Alzheimer's disease, 
muscular dystrophy or multiple sclerosis) or in cancerous processes. 
Transformed or established animal cell lines may include, for example, COS 
cells, CHO cells, VERO cells, BHK cells, HeLa cells, HepG2 cells, K562 
cells, 293 cells, L929 cells, F9 cells, and the like. Other cells, cell lines, 
tissues, organs and organisms suitable as sources of nucleic acids for use in the 
present invention will be apparent to one of ordinary skill in the art. 
| Once the starting cells, tissues, organs or other samples axe obtained, 

nucleic acid molecules (such as mRNA) may be isolated therefrom by 
methods that are well-known in the art (See, e.g., Maniatis, T., et al, Cell 
75:687-701 (1978); Okayama, H., and Berg, P., Mol. Cell. Biol 2:161-170 
(1982); Gubler, U., and Hoffman, B J., Gene 25:263-269 (1983)). The nucleic 
acid molecules thus isolated may then be used to prepare cDNA molecules and 
cDNA libraries in accordance with the present invention. 
] In the practice of the invention, cDNA molecules or cDNA libraries 

are produced by mixing one or more nucleic acid molecules obtained as 
described above, which is preferably one or more mRNA molecules such as a 
population of mRNA molecules, with a polypeptide having reverse 
transcriptase activity of the present invention, or with one or more of the 
compositions of the invention, under conditions favoring the reverse 
transcription of the nucleic acid molecule by the action of the enzymes or the 
compositions to form a cDNA molecule (single- stranded or double- stranded). 
Thus, the method of the invention comprises (a) mixing one or more nucleic 
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acid templates (preferably one or more RNA or mRNA templates, such as a 
population of mRNA molecules) with one or more reverse transcriptases of 
the invention and (b) incubating the mixture under conditions sufficient to 
make one or more nucleic acid molecules complementary to all or a portion of 
the one or more templates. Such methods may include the use of one or more 
DNA polymerases, one or more nucleotides, one or more primers, one or more 
buffers, and the like. The invention may be used in conjunction with methods 
of cDNA synthesis such as those described in the Examples below, or others 
that are well-known in the art (see, e.g., Gubler, U., and Hoffman, B.J., Gene 
25:263-269 (1983); Krug, M.S., and Berger, S.L., Meth Enzymol 
752:316-325 (1987); Sambrook, J., et al, Molecular Cloning: A Laboratory 
Manual, 2nd ed., Cold Spring Harbor, NY: Cold Spring Harbor Laboratory 
Press, pp. 8.60-8.63 (1989); WO 99/15702; WO 98/47912; and WO 
98/51699), to produce cDNA molecules or libraries. 

[0115] Other methods of cDNA synthesis which may advantageously use the 

present invention will be readily apparent to one of ordinary skill in the art. 

[0116] Having obtained cDNA molecules or libraries according to the present 

methods, these cDNAs may be isolated for further analysis or manipulation. 
Detailed methodologies for purification of cDNAs are taught in the 
GENETRAPPER™ manual (Life Technologies, a Division of Invitrogen 
Corporation, Rockville, Maryland), which is incorporated herein by reference 
in its entirety, although alternative standard techniques of cDNA isolation that 
are known in the art (see, e.g., Sambrook, J., et al, Molecular Cloning: A 
Laboratory Manual, 2nd ed. ? Cold Spring Harbor, NY: Cold Spring Harbor 
Laboratory Press, pp. 8.60-8.63 (1989)) may also be used. 

[0117] In other aspects of the invention, the invention may be used in methods 

for amplifying and sequencing nucleic acid molecules. Nucleic acid 
amplification methods according to this aspect of the invention may be one- 
step (e.g., one-step RT-PCR) or two-step (e.g., two-step RT-PCR) reactions. 
According to the invention, one-step RT-PCR type reactions may be 
accomplished in one tube thereby lowering the possibility of contamination. 
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Such one-step reactions comprise (a) mixing a nucleic acid template (e.g., 
mRNA) with one or more reverse transcriptases of the present invention and 
with one or more DNA polymerases and (b) incubating the mixture under 
conditions sufficient to amplify a nucleic acid molecule complementary to all 
or a portion of the template. Such amplification may be accomplished by the 
reverse transcriptase activity alone or in combination with the DNA 
polymerase activity. Two-step RT-PCR reactions may be accomplished in 
two separate steps. Such a method comprises (a) mixing a nucleic acid 
template (e.g., mRNA) with a reverse transcriptase of the present invention, 
(b) incubating the mixture under conditions sufficient to make a nucleic acid 
molecule (e.g., a DNA molecule) complementary to all or a portion of the 
template, (c) mixing the nucleic acid molecule with one or more DNA 
polymerases and (d) incubating the mixture of step (c) under conditions 
sufficient to amplify the nucleic acid molecule. For amplification of long 
nucleic acid molecules (i.e., greater than about 3-5 Kb in length), a 
combination of DNA polymerases may be used, such as one DNA polymerase 
having 3' exonuclease activity and another DNA polymerase being 
substantially reduced in 3' exonuclease activity. 

Nucleic acid sequencing methods according to this aspect of the 
invention may comprise both cycle sequencing (sequencing in combination 
with amplification) and standard sequencing reactions. The sequencing 
method of the invention thus comprises (a) mixing a nucleic acid molecule to 
be sequenced with one or more primers, one or more reverse transcriptase of 
the invention, one or more nucleotides and one or more terminating agents, (b) 
incubating the mixture under conditions sufficient to synthesize a population 
of nucleic acid molecules complementary to all or a portion of the molecule to 
be sequenced, and (c) separating the population to determine the nucleotide 
sequence of all or a portion of the molecule to be sequenced. According to the 
invention, one or more DNA polymerases (preferably thermostable DNA 
polymerases) may be used in combination with or separate from the reverse 
transcriptases of the invention. 
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Amplification methods which may be used in accordance with the 
present invention include PCR (U.S. Patent Nos. 4,683,195 and 4,683,202), 
Strand Displacement Amplification (SDA; U.S. Patent No. 5,455,166; EP 0 
684 315), and Nucleic Acid Sequence-Based Amplification (NASBA; U.S. 
Patent No. 5,409,818; EP 0 329 822), as well as more complex PCR-based 
nucleic acid fingerprinting techniques such as Random Amplified 
Polymorphic DNA (RAPD) analysis (Williams, J.G.K., et aL 9 Nuel. Acids Res. 
7S(22):6531-6535, 1990), Arbitrarily Primed PCR (AP-PCR; Welsh, J., and 
McClelland, M., NucL Acids Res. 18(24):12U-12\^ 1990), DNA 
Amplification Fingerprinting (DAF; Caetano-Anolles et al, Bio/Technology 
2:553-557, 1991), microsatellite PCR or Directed Amplification of 
Minisatellite-region DNA (DAMD; Heath, D.D., et al NucL Acids Res, 
21(24): 5782-5785 (1993), and Amplification Fragment Length Polymorphism 
(AFLP) analysis (EP 0 534 858; Vos, P., et al NucL Acids Res. 
23(21):AAQ1AA\A (1995); Lin, JJ., and Kuo, J. FOCUS 17(2):66-70 (1995). 
Nucleic acid sequencing techniques which may employ the present 
compositions include dideoxy sequencing methods such as those disclosed in 
U.S. Patent Nos. 4,962,022 and 5,498,523. In a particularly preferred aspects, 
the invention may be used in methods of amplifying or sequencing a nucleic 
acid molecule comprising one or more polymerase chain reactions (PCRs), 
such as any of the PCR-based methods described above. 

Kits 

i] In another embodiment, the present invention may be assembled into 

kits for use in reverse transcription or amplification of a nucleic acid molecule, 
or into kits for use in sequencing of a nucleic acid molecule. Kits according to 
this aspect of the invention comprise a carrier means, such as a box, carton, 
tube or the like, having in close confinement therein one or more container 
means, such as vials, tubes, ampules, bottles and the like, wherein a first 
container means contains one or more polypeptides of the present invention 
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having reverse transcriptase activity. When more than one polypeptide having 
reverse transcriptase activity is used, they may be in a single container as 
mixtures of two or more polypeptides, or in separate containers. The kits of 
the invention may also comprise (in the same or separate containers) one or 
more DNA polymerases, a suitable buffer, one or more nucleotides and/or one 
or more primers. 

In a specific aspect of the invention, the reverse transcription and 
amplification kits may comprise one or more components (in mixtures or 
separately) including one or more polypeptides having reverse transcriptase 
activity of the invention, one or more nucleotides needed for synthesis of a 
nucleic acid molecule, and/or one or more primers (e.g., oligo(dT) for reverse 
transcription). Such reverse transcription and amplification kits may further 
comprise one or more DNA polymerases. Sequencing kits of the invention 
may comprise one or more polypeptides having reverse transcriptase activity 
of the invention, and optionally one or more DNA polymerases, one or more 
terminating agents (e.g., dideoxynucleoside triphosphate molecules) needed 
for sequencing of a nucleic acid molecule, one or more nucleotides and/or one 
or more primers. Preferred polypeptides having reverse transcriptase activity, 
DNA polymerases, nucleotides, primers and other components suitable for use 
in the reverse transcription, amplification and sequencing kits of the invention 
include those described above. The kits encompassed by this aspect of the 
present invention may further comprise additional reagents and compounds 
necessary for carrying out standard nucleic acid reverse transcription, 
amplification or sequencing protocols. Such polypeptides having reverse 
transcriptase activity of the invention, DNA polymerases, nucleotides, 
primers, and additional reagents, components or compounds may be contained 
in one or more containers, and may be contained in such containers in a 
mixture of two or more of the above-noted components or may be contained in 
the kits of the invention in separate containers. 
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Use of Nucleic Acid Molecules 

0122] The nucleic acid molecules or cDNA libraries prepared by the methods 

of the present invention may be further characterized, for example by cloning 
and sequencing (i.e., determining the nucleotide sequence of the nucleic acid 
molecule), by the sequencing methods of the invention or by others that are 
standard in the art (see, e.g., U.S. Patent Nos. 4,962,022 and 5,498,523, which 
are directed to methods of DNA sequencing). Alternatively, these nucleic acid 
molecules may be used for the manufacture of various materials in industrial 
processes, such as hybridization probes by methods that are well-known in the 
art. Production of hybridization probes from cDNAs will, for example, 
provide the ability for those in the medical field to examine a patient's cells or 
tissues for the presence of a particular genetic marker such as a marker of 
cancer, of an infectious or genetic disease, or a marker of embryonic 
development. Furthermore, such hybridization probes can be used to isolate 
DNA fragments from genomic DNA or cDNA libraries prepared from a 
different cell, tissue or organism for further characterization. 
[0123] The nucleic acid molecules of the present invention may also be used 

to prepare compositions for use in recombinant DNA methodologies. 
Accordingly, the present invention relates to recombinant vectors which 
comprise the cDNA or amplified nucleic acid molecules of the present 
invention, to host cells which are genetically engineered with the recombinant 
vectors, to methods for the production of a recombinant polypeptide using 
these vectors and host cells, and to recombinant polypeptides produced using 
these methods. 

[0124] Recombinant vectors may be produced according to this aspect of the 

invention by inserting, using methods that are well-known in the art, one or 
more of the cDNA molecules or amplified nucleic acid molecules prepared 
according to the present methods into a vector. The vector used in this aspect 
of the invention may be, for example, a phage or a plasmid, and is preferably a 
plasmid. Preferred are vectors comprising czs-acting control regions to the 
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nucleic acid encoding the polypeptide of interest. Appropriate trans-acting 
factors may be supplied by the host, supplied by a complementing vector or 
supplied by the vector itself upon introduction into the host. 
[0125] In certain preferred embodiments in this regard, the vectors provide for 

specific expression (and are therefore termed "expression vectors"), which 
may be inducible and/or cell type-specific. Particularly preferred among such 
vectors are those inducible by environmental factors that are easy to 
manipulate, such as temperature and nutrient additives. 
[0126] Expression vectors useful in the present invention include 

chromosomal-, episomal- and virus-derived vectors, e.g., vectors derived from 
bacterial plasmids or bacteriophages, and vectors derived from combinations 
thereof, such as cosmids and phagemids, and will preferably include at least 
one selectable marker such as a tetracycline or ampicillin resistance gene for 
culturing in a bacterial host cell. Prior to insertion into such an expression 
vector, the cDNA or amplified nucleic acid molecules of the invention should 
be operatively linked to an appropriate promoter, such as the phage lambda P L 
promoter, the E. coli lac, trp and tac promoters. Other suitable promoters will 
be known to the skilled artisan. 
[0127] Among vectors preferred for use in the present invention include 

pQE70, pQE60 and pQE-9, available from Qiagen; pBS vectors, Phagescript 
vectors, Bluescript vectors, pNH8A, pNH16a, pNH18A, pNH46A, available 
from Stratagene; pcDNA3 available from Invitrogen Corporation; pGEX, 
pTrxfus, P Trc99a, pET-5, pET-9, pKK223-3, pKK233-3, pDR540, pRIT5 
available from Pharmacia; and pSPORTl, pSPORT2 and pSV-SPORTl, 
available from Life Technologies, a Division of Invitrogen Corporation. Other 
suitable vectors will be readily apparent to the skilled artisan. 
[0128] The invention also provides methods of producing a recombinant host 

cell comprising the cDNA molecules, amplified nucleic acid molecules or 
recombinant vectors of the invention, as well as host cells produced by such 
methods. Representative host cells (prokaryotic or eukaryotic) that may be 
produced according to the invention include, but are not limited to, bacterial 
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cells, yeast cells, plant cells and animal cells. Preferred bacterial host cells 
include Escherichia coli cells (most particularly E. coli strains DH10B and 
Stbl2, which are available commercially (Life Technologies, a Division of 
Invitrogen Corporation; Rockville, Maryland)), Bacillus subtilis cells, Bacillus 
megaterium cells, Streptomyces spp. cells, Erwinia spp. cells, Klebsiella spp. 
cells and Salmonella typhimurium cells. Preferred animal host cells include 
insect cells (most particularly Spodoptera frugiperda SJ9 and Sfll cells and 
Trichoplusa High-Five cells) and mammalian cells (most particularly CHO, 
COS, VERO, BHK and human cells). Such host cells may be prepared by 
well-known transformation, electroporation or transfection techniques that will 
be familiar to one of ordinary skill in the art. 
[0129] In addition, the invention provides methods for producing a 

recombinant polypeptide, and polypeptides produced by these methods. 
According to this aspect of the invention, a recombinant polypeptide may be 
produced by culturing any of the above recombinant host cells under 
conditions favoring production of a polypeptide therefrom, and isolation of the 
polypeptide. Methods for culturing recombinant host cells, and for production 
and isolation of polypeptides therefrom, are well-known to one of ordinary 
skill in the art. 

[0130] It will be readily apparent to one of ordinary skill in the relevant arts 

that other suitable modifications and adaptations to the methods and 
applications described herein are obvious and may be made without departing 
from the scope of the invention or any embodiment thereof. Having now 
described the present invention in detail, the same will be more clearly 
understood by reference to the following examples, which are included 
herewith for purposes of illustration only and are not intended to be limiting of 
the invention. 
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EXAMPLES 

[0131] The following Materials and Methods were used in the Examples 

below. 

LacZ forward assay using an RNA template 

[0132] The assay was conducted as described in Boyer JC, et al Analyzing 

the fidelity of reverse transcription and transcription. Methods Enzymol. 275: 

523 (1996), with the following exceptions. 
[0133] Preparation of RNA template. A clone of pUC19 (homologous with 

M13mpl9 in lacZ region) with the T7 RNA promoter inserted between 

nucleotides 1 12 and 113 was used as the RNA template. 
[0134] Construction of gapped M13 substrate. M13mpl9 (Life Technologies, 

a Division of Invitrogen Corporation, Rockville, Maryland) was used instead 

ofM13mp2. 

[0135] Preparation of Competent cells. Electromax DH12S competent cells 

(Life Technologies, a Division of Invitrogen Corporation, Rockville, 
Maryland) were used instead of MCI 061. 

[0136] Mutagenesis. The mutants were made by oligo-directed 

mutagenesis as described in Kunkel, T.A. et al Methods Enzymol. 204: 125 
(1991). Briefly, the Superscript II gene (an M-MLV RT gene containing point 
mutations in the RNase H domain, see below) was inserted into pBADhisA 
(Invitrogen, Carlsbad, CA) vector and named pBAD-SS II. This plasmid was 
transformed into DH11S cells (Life Technologies, a Division of Invitrogen 
Corporation, Rockville, Maryland) and infected with M13K07 helper phage 
from which single strand DNA was isolated. Oligos were designed 
corresponding to each mutation: Y64W, R116M, K152R, Q190F, T197A, and 
V223H. 100 jjM of each oligo was 32 P-labeled with T4 polynucleotide kinase 
(Life Technologies, a Division of Invitrogen Corporation, Rockville, 
Maryland) using the Forward Rxn Buffer (Life Technologies, a Division of 
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Invitrogen Corporation, Rockville, Maryland). The oligo was annealed to the 
single strand pBAD-SS II. Native T7 DNA polymerase (USB, Cleveland, 
OH) and T4 DNA ligase (Life Technologies, a Division of Invitrogen 
Corporation, Rockville, Maryland) were added with synthesis buffer (0.4 mM 
dNTPs, 17.5 mM Tris-HCl, pH 7.5, 5 mM MgCl 2? 2.5 mM DTT, and 1 mM 
ATP) to the annealed reaction on ice. The reactions were incubated at 37°C for 
30 minutes and terminated by adding 1 |j,l of 0.5 M EDTA (Life Technologies, 
a Division of Invitrogen Corporation, Rockville, Maryland). The reactions 
were transformed and plated with DH10B cells. Colonies were picked and 
mutants were determined by restriction analysis and sequenced using an ABI 
377 and ABI Big Dye terminator cycle sequencing ready reaction kit (PE 
Applied Biosystems, Foster City, CA) for confirmation. 

Protein Purification of mutants. The cell pellet containing induced RT 
was suspended in a ratio of 2 mL Lysis buffer (40 mM Tris-HCl, pH 8.0, 0.1 
M KC1, 1 mM PMSF)/1 gram of cell pellet. The suspension was sonicated on 
ice and then centrifuged at 27000 x g for 30 min. The cell-free extract was 
filtered through a 0.45 \i syringe filter. The cell-free extract was applied to a 5- 
mL Ni2+ HI-TRAP column (Pharmacia) pre-equilibrated with 5 volumes 5 
mM imidazole in Buffer A (40 mM Tris-HCl, pH 8.0, 10% glycerol, 0.01% 
Triton X-100, 0.1 M KC1) at 1 mL/min. The column was washed with 10 
volumes 5 mM imidazole in Buffer A. The RT was eluted by washing with 20 
volumes of a gradient of 5 mM to 1 M imidazole in Buffer A. The eluate 
containing RT protein was applied to a 1-mL Mono-S column (Pharmacia) 
pre-equilabrated with 10 column volumes 50 mM KC1 in Buffer B (40 mM 
Tris-HCl, pH 8.0, 10% glycerol, 0.01% Triton X-100, 0.1 mM EDTA, 1 mM 
DTT) at a flow rate of 1.0 mL/min. The column was washed with 10 volumes 
50 mM KC1 in Buffer B. RT was eluated with 20 volumes of a gradient from 
50 mM to 1 M KC1 in Buffer B. The individual fractions were analyzed for RT 
activity. The fraction containing peak RT activity was dialyzed against 
Storage buffer (40 mM Tris-HCl, pH 8.0, 50% glycerol, 0.01% Triton X-100, 
0.1 mM EDTA, 1 mM DTT, 0.1 M KC1). The isolated proteins were more 
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than 95% pure, as judged by SDS-PAGE. The protein concentrations were 
determined by using the Biorad colorimetric kit with BSA as a standard. 

EXAMPLE 1: 
Mutation frequency of M-MLV high fidelity mutants 

[0138] Mutation frequency Data and Calculation of Error Rates. Mutation 

frequency (MF) is determined by dividing the number of mutant plaques (light 
blue or white) by the total number of plaques and then subtracting the 
background mutation frequency of the starting DNA. 

[0139] All mutant reverse transcriptases tested also contained the point 

mutations to remove RNase H activity, as in Superscript II (SS II, U.S. Patent 
Nos. 5,244,797; 5,405,776; 5,668,005 and 6,063,608). Point mutations were 
made in the M-MLV RT gene to remove RNase H activity. The point 
mutations include D524G, D583N, and E562Q. Briefly, the RT gene from 
pRT601 was inserted into a pUC plasmid and then the above point mutations 
were made in the RNase H domain of the RT gene. pRT601 is described in 
U.S. Patent Nos. 5,244,797; 5,405,776; 5,668,005 and 6,063,608 and was 
deposited at the ATCC under Accession No. 67007 (See U.S. Patent No. 
5,017,492). This RNase H" mutant is referred to herein as Superscript II or 
Superscript II gene. 
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TABLE 2: 
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[0140] The lacZcx assay employs the provided RT to copy lacZa RNA to 

cDNA. This cDNA copy, when annealed to Ml 3, transfected, and expressed, 
will have either the normal wild type phenotype of a dark blue plaque, or, if 
the RT made a mistake while copying, it will have the mutant phenotype of a 
light blue or clear plaque. The mistakes could be in the form of insertions, 
deletions or misincorporations. Any decrease from the mutation frequency of 
M-MLV RNase H" RT (Superscript II) by the mutant RTs indicate an increase 
in fidelity. As shown in TABLE 2, the selected mutants demonstrate a 1.5-2,3 
fold decrease in mutation frequency. The mutant V223F has a mutation 
frequency 2.2 fold higher than SS II and thus has lower fidelity. 

EXAMPLE 2: 
Misinsertion assays with DNA template. 

[0141] Misinsertion assay ofY64W f R116M, K152R, Q190F, T197A, V223H 

M-MLV RNase IT RT with DNA template. This assay was employed to 
compare the misincorporation capability of the mutants to Superscript II (M- 
MLV RNase H" RT). The assay is a primer extension assay using synthetic 
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DNA template-primer and biased dNTP pools containing only three of the 
four dNTPs. The reactions are displayed on a gel in Figs. 1-3. In this assay, 
higher efficiency of primer extension in the absence of one dNTP denotes 
lower fidelity. As shown in Figs. 1-3, in the presence of all 4 dNTPs, 
Superscript II and all the selected mutants were able to extend the primer 
approximately equally, with some variance in the addition of non-template 
nucleotides at the end of the primer. However when incubated with biased 
pools of nucleotides, SS II was able to catalyze substantial extension past 
template nucleotides for which a complementary dNTP was missing, 
indicating use of incorrect nucleotides and lower fidelity. In Fig.l, the V223H 
mutant (designated as lane 2) showed shorter extension products than SS II in 
each of the biased pools of three dNTPs, indicating less ability to incorporate 
incorrect nucleotides and thus higher fidelity. This corresponds with the 
results of the lacZcc assay where the V223H mutant had a lower mutation 
frequency than SS II, 21 x 10" 4 versus 44 x 10" 4 , respectively. On the other 
hand V223F (lane 3), which had a higher mutation frequency (97 x 10~ 4 ) than 
SS II (44 x 10" 4 ) in the lacZa assay, also has equal sized or longer extension 
products than SS II in each of the biased pools, indicating that it has a lower 
fidelity. These data shows a correlation between the misinsertion assay on 
DNA and the lacZa assay on RNA wherein higher fidelity mutants had both 
shorter extension products with biased pools of dNTPs and lower mutation 
frequencies in the lacZa assay. Fig. 2 and Fig. 3 show similar results for the 
mutants R116M, Q190F, K152R, T197A, and Y64W, where each had shorter 
primer extension products than SS II in the biased nucleotide pools. 

EXAMPLE 3 
TdT reverse transcriptase mutants 

[0142] In checking fidelity mutants of reverse transcriptase (RT) for 

misextension in a 3 dNTP assay, it was observed that SS II RT extended 2-3 
bases past the end of the template in the presence of 3 and 4 dNTPs. This non- 
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template directed extension or TdT activity is reduced in many mutants, but in 
a few such as F309N and T197E it appears that this activity is severely 
reduced or eliminated. These mutants are probably in close proximity or in 
contact with the template-primer as determined by homology to HIV reverse 
transcriptase and its crystal structure with bound template-primer. 

METHODS 

Mutagenesis 

For F309N: 

[0143] Primers were designed corresponding to the mutant position F309 with 

the silent insertion of a NgoMIV restriction site at amino acid positions 310- 
31 L The primers encoded a random NNK sequence for this position 
generating a random library of F309 mutants, where N is any of the four bases 
and K is T or G. The primers along with internal SS II RT primers at an 
upstream SstI restriction site and a downstream Sail restriction site were used 
in a standard PCR reaction (10 ng SS II RT template, 2 pM of each primer, 48 
jjl Supermix (Life Technologies, a division of Invitrogen Corporation) for 20 
cycles of 94°C 15 sec, 55°C 15 sec, 72°C 30sec) to generate two PCR 
fragments. These were a 240 bp Sstl-NgoMIV fragment and a 200 bp 
NgoMIV-Sall fragment. The fragments were isolated and digested and ligated 
together and then inserted into the original SS II RT clone cut with SstI and 
Sail. The resulting ligation product was transformed in Max Efficiency 
DH10B (Life Technologies, a division of Invitrogen Corporation) competent 
cells to create the library of mutants at site F309. This library was then plated 
overnight for selection. 

ForT197E and Y133A: 
[0144] The mutants T197E and Y133A were made by oligo-directed 

mutagenesis as described in Kunkel, T.A. et al Methods Enzymol 204; 125 
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(1991). Briefly, the Superscript II RT gene was inserted into pBADhisA 
(Invitrogen Corporation) vector and named pBAD-SSIL This plasmid was 
transformed into DH1 IS cells and the cells were infected with M13K07 helper 
phage from which single strand DNA was isolated. Oligos were designed 
corresponding to each mutation: T197E and Y133A. Each oligo (100 |uM) was 
kinased with T4 DNA kinase (Life Technologies, a division of Invitrogen 
Corporation) using the Forward Rxn Buffer (Life Technologies, a division of 
Invitrogen Corporation). The oligo was annealed to single stranded pBAD- 
SSII DNA. Native T7 DNA polymerase (USB) and T4 DNA ligase (Life 
Technologies, a division of Invitrogen Corporation) were added with synthesis 
buffer (0.4 raM dNTPs, 17.5 mM Tris-HCl, pH 7.5, 5 mM MgCl 2? 2.5 mM 
DTT, and 1 mM ATP) to the annealed reaction on ice. The reactions were 
incubated at 37°C for 30 minutes and terminated by adding 1 \il of 0.5 M 
EDTA. The reactions were transformed and plated with DH10B cells. 
Colonies were picked and mutants were determined by restriction enzyme 
analysis and sequenced for confirmation using an ABI 377 instrument and 
ABI Big Dye Terminator Cycle Sequencing Ready Reaction kit. 

Selecting Colonies Containing Active RT. Individual transformant 
colonies were inoculated into single wells of a 96 well culture plate. Each well 
contained 120 jal of media (EG-Ap) containing 0.2% arabinose. It is preferable 
to first inoculate a 96 well plate with selective medium without the inducer, to 
grow that master plate overnight, and then to make a replica of the master 
plate into a 96-well plate with the inducer and grow that plate overnight. The 
cultures were grown overnight at 37°C without shaking. Overnight cultures 
were mixed with an equal volume of 2X PLD (1.8% glucose, 50 mM Tris- 
HC1, pH 8.0, 20 mM EDTA, 20 mM DTT, 1% Triton X-100, 2 mg/mL 
lysozyme) at room temperature. These extracts were assayed directly for RT 
activity by mixing 10 ]ul of the extract with 40 jui of 1.25X RT reaction mix 
(62.5 mM Tris-HCl, pH 8.4, 62.5 mM KC1, 12.5 mM MgCl 2; 12.5 mM DTT, 
1.25 mM dGTP, polyC/ oligo dG (3.75 mM/1.5 mM in nucleotide), [ 32 P] 
dGTP). This reaction was placed in a 37°C water bath for 10 min. A small 
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aliquot of the reaction mixture (5 \xl) was spotted onto a charged nylon 
membrane (Genescreen+, NEN). The membrane was washed twice with 
10%TCA + 1% sodium pyrophosphate, rinsed with ethanol, dried, and placed 
next to a phosphor screen. Radioactive product that had been trapped on the 
filter was detected by analyzing the screen in a Phosphorimager, using 
ImageQuant software (Molecular Devices). Candidates were selected if they 
showed RT activity (radioactivity). These candidates were screened a second 
time to confirm the phenotype. The confirmed candidates were then sequenced 
to determine which amino acids maintained detectable RT activity. 

Purification of RT Mutants. 

The cell pellet containing induced RT was suspended in a ratio of 2 
mL Lysis buffer (40mM Tris-HCl, pH 8.0, 0.1 M KC1, 1 mM PMSF)/ 1 gram 
of cell pellet. The suspension was sonicated on ice and then centrifuged at 
27,000g for 30 min. The cell-free extract was filtered through a 0.45 ja syringe 
filter. The cell-free extract was applied to a 5 mL Ni 2+ HI-TRAP column 
(Pharmacia) pre-equilibrated with 5 volumes 5 mM imidazole in buffer A (40 
mM Tris HC1, pH 8.0, 10% glycerol, 0.01% Triton X-100, 0.1 M KC1) at 1 
mL/min. The column was washed with 10 volumes 5 mM imidazole in buffer 
A. The RT was eluted by washing with 20 volumes of a gradient of 5 mM to 
1M imidazole in buffer A. The eluate containing RT protein was applied to a 
1 mL Mono-S column (Pharmacia) pre-equilabrated with 10 column volumes 
50 mM KC1 in buffer B (40 mM Tris-HCl, pH 8.0, 10% glycerol, 0.01% 
Triton X-100, 0.1 mM EDTA, 1 mM DTT) at a flow rate of 1.0 mL/min. The 
column was washed with 10 volumes of 50mM KC1 in buffer B. RT was 
eluted with 20 volumes of a gradient from 50 mM to 1 M KC1 in buffer B. The 
individual fractions were analyzed for RT activity. The fraction containing 
peak RT activity was dialyzed against storage buffer (40 mM Tris-HCl, pH 
8.0, 50% glycerol, 0.01% Triton X-100, 0.1 mM EDTA, 1 mM DTT, 0.1 M 
KC1). The purified RTs were more than 95% pure, as judged by SDS-PAGE. 
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The protein concentrations were determined by using the Biorad colorimetric 
kit. 

[0147] 3 dNTP Assay Method. Procedures were modified from those of 

Preston, B.D., et al Science 242:1 168 (1988). The DNA template-primer was 
prepared by annealing a 47-mer template (5'- 
GAGTTACAGTGTTTTTGTTCCAGTCTGTAGCAGTGTGTGAATGGAA 
G-3') (SEQ ID NO: 1) to an 18-mer primer (5'- 

CTTCCATTCACACACTGC-3') (SEQ ID NO: 2) [ 32 P]-labeled at the 5'-end 
with T4 polynucleotide kinase (template :primer, 3:1). Assay mixture (10 jil) 
contained 5 nM template-primer, 50-200 nM RT as specified in figure 
legends, 3 or 4 dNTPs (250 jiM each), 50 mM Tris-HCl (pH 8.3), 75 mM 
KC1, 3 mM MgCl 2? 10 mM DTT. Reactions were incubated at 37°C for 30 min 
and terminated by the addition of 5 |al of 40 mM EDTA, 99% formamide. 
Reaction products were denatured by incubating at 95 for 5 min and analyzed 
by electrophoresis on urea 6% polyacrylamide gels. 

[0148] To determine if any TdT activity was occurring in the control reaction 

of the 3 dNTP assay, which uses all 4 dNTPs, was repeated with varying 
amounts of enzyme, >600 units to 20 units, at 37°C for 30 min. For SS II, 
T197E, and Y133A, 200, 100, 50, and 20 units were used. For F309N, 646, 
200, 50, 20 units were used. 

RESULTS 

[0149] We carried out a misinsertion assay of F309N (H204R, T306K) 

Superscript II RT, hereafter referred to as F309N, with DNA template. This 
assay was employed to compare the misincorporation capability of the mutant 
to Superscript II. The assay is a primer extension assay using synthetic DNA 
template-primer and biased dNTP pools containing only three of four dNTPs. 
The reactions are displayed on a gel in Fig. 4. While conducting this procedure 
to screen for mutants with lower misensertion/misextension rates it was 
observed that SS II RT extended 2-3 nucleotides past the template end and that 
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some mutations reduced or appeared to eliminate this non-template directed 
extension or TdT activity. As shown in Fig. 4, in the presence of all 4 dNTPs, 
Superscript II RT and the mutant F309N were able to extend the primer 
approximately equally, with SS II RT adding 2 nucleotides past the template, 
and F309N adding none beyond the end of the template. To further evaluate 
this non-templated directed extension the control reaction for the 3 dNTP 
misextension assay containing all 4 dNTPs was carried out with SS II, F309N, 
T197E, and Y133A RT for 30 minutes with varying amounts of enzyme. The 
three mutants had shown very reduced levels of TdT activity in prior screens. 
Since it had been observed that 5 minutes with 20 units of enzyme was more 
than enough time for the primer extension to be completed, a 30 minute 
incubation and 200 to 646 units of RT were both in large excess over what 
was necessary for the reaction to be completed. As seen in Fig. 5, all the RT 
reactions at the lowest amount tested had similar extension products to the 
reactions at the highest unit concentrations demonstrating that the reaction had 
gone to completion. SS II RT added 2 nucleotides past the end of the 
template, F309N and T197E did not extend past the end of the template, and 
Y133A appears to have a small amount of product that is 1 nucleotide past the 
end of the template. 

EXAMPLE 4: 
Dual Thermostable and TdT Mutants. 

[0150] The F309 amino acid position in M-MLV reverse transcriptase (RT) 

aligns with the W266 position in HIV reverse transcriptase. This position is at 
the base of the thumb domain and is considered part of the minor groove 
binding tract which interacts with the minor groove of the template-primer. 
The mutations H204R and T306K have been shown to increase the 
thermostability of the enzyme. These mutations are described in U.S. 
Application No. 60/207,197, filed May 26, 2000, the disclosure of which is 
incorporated herein by reference in its entirety. The F309N mutation in an 
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H204R/T306K clone displays 2.3x lower mutation frequency in a lacZ 
forward assay (Table 3) on RNA template and shorter extension products in a 
3 dNTP extension assay than Superscript II RT or H204R/T306K in 
Superscript II RT. Both findings support the claim of an enzyme with higher 
fidelity (Table 4). 



Table 3 

Mutation Frequency of M-MLVRTHigh Fidelity Mutants 



Construct 
SSII 

SSII (H204R, T306K) 

SSII (H204R, T306K, 
F309N) 

SSII (H204R,T306K, 
F309N,V223H) 



total plaques mutant plaques 
15689 87 
14410 83 



11623 



11415 



39 



39 



MF(x 10" 4 ) 
39 
41 

17 

14 



Table 3. The mutation frequency of Superscript II RT and point mutants. 
Mutation frequency (MF) was determined by dividing the number of mutant 
plaques (light blue or white) by the total number of plaques. The background 
mutant frequency of the starting DNA was 17 x 10" 4 for the first 3 constructs 
and 20 x 10" 4 for the last construct. 

Table 4 

Error Rates of M-MLV RT High Fidelity Mutants 



M-MLV Superscript II F309N V223H/F309N 



Overall ER (oER) 1/17,000 1/15,000 1/34,000 1/41,000 
Mismatch 

% of total 46 35 68 72 

ER(mER) 1/37,000 1/42,000 1/50,000 1/58,000 

Frameshift 

% of total 46 60 21 22 

ER(rER) 1/37,000 1/25,000 1/162,000 1/188,000 
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Strand Jump 

% of total 8 5 11 6 

ERQER) 1/213,000 1/297,000 1/324,000 1/690,000 

METHODS 

[0151] Mutagenesis. Using a standard site directed mutagenesis protocol, as 

described in Example 3, a primer containg the V223H mutation was annealed 
to single strand DNA of Superscript II with the following mutations: H204R, 
T306K, F309N. The colonies were sequenced to confirm the new combination 
of V223H, H204R, T306K, and F309N. 

[0152] Selecting Colonies Containing Active RT. Colony selection was 

performed as in Example 3. 

[0153] Purification of RT mutants. Purification was performed as in 

Example 3. 

[0154] Sequencing of plaques. The plaques from the lacZ forward assay were 

transferred from the soft agar plate to Whatmann 3 MM paper and allowed to 
dry for at least 1 hour. The plaque was then punched out and the plaque/paper 
disk was added directly to a sequencing reaction mix containing 4-8 \xl ABI 
PRISM Dye Terminator Cycle Sequencing Ready Reaction (Perkin Elmer) 
1 ixl primer (GAAGATCGCACTCCAGCCAGC) (SEQ ID NO: 3) and 
distilled water to 20 |al total volume. The ABI cycle sequencing protocol was 
used for 96°C 10 seconds, 50°C 5 seconds, 60°C 4 minutes for 25 cycles. The 
paper disks were removed and the reactions were precipitated, then 
resuspended in loading dye and run on an ABI 377 sequencing machine. 

[0155] The sequences were compared to wild type lacZ alpha sequence and 

then classified as frameshift (either 1 nucleotide insertion or deletion), 
mismatch, or strand jump (an insertion or deletion between repeated 
sequences). The overall error rate for each class was determined by dividing 
the mutation frequency by the number of detectable sites (i.e., sites the 
alteration of which results in a phenotypic change) (116) multiplied by 0.5 (to 
exclude the original single strand contribution) and then multiplied by the 
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percentage of mutants observed to be in each class. ER = MF/(detectable sites 
* 0.5) * (% in each class). 
[0156] 3dNTP assay method. 3dNTP assays were performed as in Example 3. 

RESULTS 

[0157] We carried out a misinsertion assay of F309N (H204R T306K) 

Superscript reverse transcriptase, hereafter referred to as F309N, and V223H 
F309N (H204R T306K), hereafter referred to as V223H/F309N with DNA 
template. This assay was employed to compare the misincorporation capability 
of the mutant to Superscript IL The assay is a primer extension assay using 
synthetic DNA template-primer and biased dNTP pools containing only three 
of the four dNTPs. The reactions are displayed on a gel in Fig. 6 and Fig. 7. In 
this assay, higher efficiency of primer extension denotes lower fidelity. As 
shown in Figs. 6 and 7, in the presence of all 4 dNTPs, Superscript II RT and 
the mutants F309N and V223H/F309N were able to extend the primer 
approximately equally, with some variance in the addition of non-template 
directed nucleotides at the end of the primer. However when incubated with 
biased pools of nucleotides, Superscript II RT was able to catalyze substantial 
extension past template nucleotides for which a complementary dNTP was 
missing, indicating use of incorrect nucleotides and lower fidelity. In Fig. 6, 
the F309N (2) mutant showed shorter extension products than SS II RT in 
each of the biased pools of three dNTPs, indicating less ability to incorporate 
incorrect nucleotides and thus higher fidelity. In Fig. 7, the V223H/F309N 
mutant was extended with just the dATP and dCTP pools. In each case 
V223H/F309N also had lower extension products than Superscript II. This 
corresponds with the results of the lacZa assay where the F309N and 
V223H/F309N mutants had a lower mutation frequency than SS II RT 
(17xl0~ 4 and 14xl0" 4 to 39xl0" 4 ). The RT with just the H204R T306K 
mutations without F309N has a mutation frequency similar to SS II RT 
(41xl0~ 4 to 39xl0" 4 ), suggesting that these muations do not influence fidelity. 
This data shows a correlation between the misinsertion assay on DNA and the 
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lacZa assay on RNA wherein higher fidelity mutants had both shorter 
extension products with biased pools of dNTPs and lower mutation 
frequencies in the lacZa assay. 

EXAMPLE 5 
Error Rate Determination 

To determine Error Rates, mutant plaques from the lacZ forward assay 
were sequenced using known methods. The mutations were then classified 
into one of the following categories: mismatches for misinsertion events, 
frameshifts for single insertion or deletion events, or jumps for large insertions 
or deletions caused by jumping between similar sequences. An overall Error 
Rate was then determined for nucleic acid encoding the lacZ alpha peptide 
using the following equation: 

ER (error rate) = MF (mutation frequency) / (number of detectable 

sites x 0.5), where the number of detectable sites is 1 16. 

Not all bases mutated in lacZ forward assays result in a detectable 
phenotypic change. To determine specific error rates for mismatch, frameshift 
and jumps, the mutation frequency was modified by multiplying by the 
percent of the total of each mutant category, and then used to determine the 
specific error rate. The following is a sequence map of the lacZa peptide in 
M13mpl9 from Superscript II RT and the high fidelity Superscript II H203R 
T306K F309N reverse transcriptase assays. Underlining indicates deletions; 
" A " indicates insertions of the base A, T, C, or G shown above; A, T, C, or G 
shown above the complete sequence indicates mismatches. 
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Map of Superscript II 



T C 

T t TC C 

AGCGCAACGC AATTAATGTG AGTTAGCTCA CTCATTAGGC ACCCCAGGCT TTACACTTTA 
1 1 4 

CG C CC 

TGCTTCCGGC TCGTATGTTG TGTGGAATTG TGAGCGGATA ACAATTTCAC ACAGGAAACA 
1 

C 

CC CG C 

GCTATG ACC ATG ATT ACG^CCA AGC TTG CAT GCC TGC AGG TCG ACT CTA GAG GAT CCC CGG 

1 

T AAAA 
X A AAA 
T T A A 

T T T A T AC 

GTA CCG AGC TCG AAT TCA CTG GCC GTC GTT^TTA CAA CGT CGT GAC TGG GAA AAC CCT GGC 

7 111 

C TTTTT 
C TTT 
ATT 

T CC C TTCTCTTCG T 

GTT ACC CAA CTT AAT CGC CTT GCA GCA CAT CCC'CCT^TTC^GCC AGC TGG CGT 

1 4 



AAT AGC G (SEQ ID NO: 4) 



Table 5 



Insertions 


40 


38% 


60% frameshift 
(insertion or 
deletion) 


Deletions 


23 


22% 




Mismatches 


36 


35% 


35% mismatch 


Jumps 


5 


5% 


5% Jumps 



Table 6 



Overall Error Rate (oER) 


1/15,000 


(39xlCT 4 )/(116x0.5) 


Mismatch Error Rate (mER) 


1/42,500 


(0.35 x 39x1 0" 4 )/(l 16x0.5) 


Frameshift Error Rate (fER) 


1/25,000 


(0.60 x39xl0 _4 )/(116x 0.5) 


Jumps Error Rate (jER) 


1/297,000 


(0.05 x39xl0- 4 )/(116x 0.5) 
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[0160] All publications, patents and patent applications mentioned in this 

specification are indicative of the level of skill of those skilled in the art to 
which this invention pertains, and are herein incorporated by reference to the 
same extent as if each individual publication, patent or patent application was 
specifically and individually indicated to be incorporate by reference. 

[0161] Having now fully described the present invention in some detail by 

way of illustration and example for purposes of clarity of understanding, it 
will be obvious to one of ordinary skill in the art that the same can be 
performed by modifying or changing the invention within a wide and 
equivalent range of conditions, formulations and other parameters without 
affecting the scope of the invention or any specific embodiment thereof, and 
that such modifications or changes are intended to be encompassed within the 
scope of the appended claims. 



